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Abstract  Continued 

With  uncertainty  taken  directly  on  the  noise  samples,  we 
develop  an  estimation-detection  theoretic  approach:  the  detec¬ 

tion  statistic  preserves  the  structure  of  the  quadrature  matched 
filter,  but  in  place  of  the  linear  sample  mean,  a  minimax 
robust  estimator  of  the  random  amplitude  is  substituted.  This 
test  is  shown  to  be  asymptotically  maximin  optimal  (in  the  sense 
of  Huber)  for  a  wide  family  of  decision  rules  and  for  several 
common  target  signal  models.  In  addition,  the  maximin  bound  on 
the  desired  detection  probability  is  sufficiently  large  to  be  in 
the  range  of  practical  interest. 

This  test  is  then  extended  to  handle  the  important  case  of 
unknown  power  level.  When  noise  reference  samples  are  available, 
the  extended  test  utilizes  an  adaptive  threshold,  which  serves  as 
an  estimator  of  the  variance  of  the  amplitude  estimator  under  the 
noise-only  situation.  This  version  is  shown  to  be  maximin  robust 
in  the  class  of  "Sliding-window"  tests.  Its  implementation  in 
the  nominally  Gaussian  case  is  relatively  simple,  as  it  requires 
only  o-trimmed  estimators.  When  detection  is  to  be  performed 
with  the  signal-plus-noise  observations  only,  scale  invariance  is 
achieved  by  coupling  the  "location"  estimators  with  a  robust 
scale  estimator.  Here,  only  qualitative  robustness  is  achieved. 
However,  within  various  parametrized  mixture  families,  the 
false-alarm  (detection)  probability  can  be  bounded  from  above 
(below) . 

Finally,  Monte  Carlo  simulation  results  show  that  the 
asymptotic  analysis  accurately  predicts  the  small  sample  perfor¬ 
mance,  down  to  n  =  16  samples.  Moreover,  the  proposed  tests 
substantially  outperform  the  corresponding  robust  tests  derived 
from  the  traditional  weak  signal  assumption. 
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ABSTRACT 

In  this  report,  we  study  procedures  for  robust  detection  of  slowly  fading  narrow- 
band  signals  in  nearly  Gaussian  noise,  a  common  model  for  radar /sonar  systems. 

For  the  classical  quadrature  matched  filter  test,  we  introduce  an  uncertainty  about 
the  envelope  statistic  and  develop  a  robust  test  for  this  one-sample  situation.  It  is 
demonstrated  that  this  test  is  capable  of  protecting  only  against  relatively  weak  contam¬ 
inations. 

With  uncertainty  taken  directly  on  the  noise  samples,  we  develop  an  estimation- 
detection  theoretic  approach:  the  detection  statistic  preserves  the  structure  of  the  qua¬ 
drature  matched  filter,  but  in  place  of  the  linear  sample  mean,  a  minimax  robust  estima¬ 
tor  of  the  random  amplitude  is  substituted.  This  test  is  shown  to  be  asymptotically 
maximin  optimal  (in  the  sense  of  Huber)  for  a  wide  family  of  decision  rules  and  for 
several  common  target  signal  models.  In  addition,  the  maximin  bound  on  the  desired 
detection  probability  is  sufficiently  large  to  be  in  the  range  of  practical  interest. 

This  test  is  then  extended  to  handle  the  important  case  of  unknown  power  level. 
When  noise  reference  samples  are  available,  the  extended  test  utilizes  an  adaptive  thres¬ 
hold,  which  serves  as  an  estimator  of  the  variance  of  the  amplitude  estimator  under  the 
noise-only  situation.  This  version  is  shown  to  be  maximin  robust  in  the  class  of 
“Sliding-window”  tests.  Its  implementation  in  the  nominally  Gaussian  case  is  relatively 
simple,  as  it  requires  only  a-trimmed  estimators.  When  detection  is  to  be  performed  with 
the  signal-plus-  noise  observations  only,  scale  invariance  is  achieved  by  coupling  the 
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“location”  estimators  with  a  robust  scale  estimator.  Here,  only  qualitative  robustness  is 
achieved.  However,  within  various  parametrized  mixture  families,  the  false-alarm  (detec¬ 
tion)  probability  can  be  bounded  from  above  (below). 

Finally,  Monte  Carlo  simulation  results  show  that  the  asymptotic  analysis  accu¬ 
rately  predicts  the  small  sample  performance,  down  to  n  =16  samples.  Moreover,  the 
proposed  tests  substantially  outperform  the  corresponding  robust  tests  derived  from  the 
traditional  weak  signal  assumption. 
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1.  INTRODUCTION 


1.1  Motivation  and  Outline 

The  classical  theory  of  signal  detection  is  based  on  a  complete  statistical  characteri¬ 
zation  of  the  signals  and  interference  that  are  typical  of  the  environment.  In  many  cases, 
especially  for  the  adverse  situations  in  which  radar  and  sonar  systems  operate,  this  pre¬ 
cise  modeling  is  either  prohibitive  or  impossible.  Consequently,  there  is  a  strong  motiva¬ 
tion  to  study  the  design  of  detection  procedures  which  are  robust  against  deviations 
from  the  assumed  statistical  models. 

Following  the  fundamental  work  of  Huber  on  robust  estimation  of  location  parame¬ 
ters  [1]  and  on  robust  hypotheses  testing  [2],  extensive  applications  and  further  research 
have  appeared  in  the  communication  and  information  sciences  literature,  of  which  refer¬ 
ences  [4]- [12]  are  a  representative  sample.  Inspection  of  these  references  reveals  that 
they  are  mostly  concerned  with  lowpass  deterministic  (i.e.,  unknown  but  non-random) 
signals,  except  [9]  which  treats  robust  detection  of  weak  stochastic  signals,  and  [12] 
where  results  have  recently  been  presented  for  deterministic  narrow-band  signals.  How¬ 
ever,  the  target  signals  that  are  received  by  radar-sonar  systems  (which  are  among  the 
most  important  engineering  applications  of  statistical  detection  theory)  in  typical 
environments,  are  represented  by  more  complicated  structures  than  those  studied  in  the 
above  mentioned  references.  These  would  include  slowly  fading  narrow-band  signals 
with  random  amplitude  and  phase,  or  range  and  doppler  spread  targets  [14].  While  an 
extensive  detection-estimation  theory  exists  for  these  complex  models  when  both  signal 
and  noise  are  Gaussian  narrow-band  processes,  c.f  [13]-  [15],  robust  detectors  have  not 
been  devised  for  these  signal  structures.  The  results  of  [4]- [12]  are  not  directly  applicable, 
except  for  those  of  Shin  and  Kassam  in  illj  who  proposed  a  rather  ad-hoc  robust  detec¬ 
tor  for  a  Rayleigh  narrow-band  signal. 

The  main  purpose  of  this  work  is  the  design  and  analysis  of  robust  detection  pro¬ 
cedures  for  coherent  ,  slowly  fading  narrow-band  signals  1  in  nearly  Gaussian  noise. 

'This  is  probably  the  most  important  case  for  modern  radar-sonar  systems  that  operate 
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Under  the  slow-fading  model,  the  signal  random  amplitude  and  phase  are  essentially  con¬ 
stant  over  the  observation  period;  while  their  distributions  are  assumed  to  be  known,  the 
average  amplitude  level  is  unknown.  Unlike  previous  works  [5],[7]-[l0j,  and  [12],  where 
the  objective  was  to  robustify  the  local  performance  for  weak  signals,  we  seek  a  global 
solution  which  will  be  optimal  for  any  signal  amplitude.  We  achieve  this  through  a 
different  approach  in  which  the  detection  test  statistic  is  based  on  a  robust  estimation  of 
the  random  amplitude.  (A  similar  structure  was  proposed  by  El-Sawy  and  VandeLinde 
for  the  limited  case  of  deterministic  lowpass  signals).  This  procedure  is  shown  to  be 
asymptotically  maximin  optimal  in  the  sense  of  Huber  [2]  within  a  very  wide  family  of 
decision  rules. 


Outline  Of  The  Work 

Chapter  2  introduces  the  problem  and  presents  a  performance  analysis  of  the  com¬ 
mon  (unrobustified)  quadrature  detector  under  situations  of  contamination.  Since  in 
most  practical  applications  even  if  the  noise  distribution  is  known  its  level  is  not,  the 
adaptive  threshold  version  of  the  detector  is  emphasized.  Here,  the  test  statistic  is  nor¬ 
malized  by  the  Maximum-Likelihood  (ML)  estimate  of  the  noise  level.  While  this  test 
achieves  the  desired  Constant  False  Alarm  Rate  (CFAR)  property  asymptotically,  it  is 
demonstrated  that  for  small  sample  sizes  the  false  alarm  probability  (Pfa)  can  increase 
intolerably  under  contamination,  in  addition  to  a  large  decrease  in  the  detection  proba¬ 
bility  ( Pd ). 

In  Chapter  3  we  study  a  robust  test  on  the  “coherent  envelope”  of  the  observables, 
i.e.,  on  the  output  of  a  quadrature  matched  filter  which  is  the  sufficient  statistic  under 
the  purely  Gaussian  noise  case.  For  this  scalar  statistic,  extension  of  Huber’s  [2]  robust 
test  is  straightforward.  It  consists  essentially  in  finding  the  threshold  setting  and  a  ran¬ 
domization  constant  under  a  least  favorable  noise  distribution.  However,  it  is  found  that 


in  severe  clutter  (correlated  noise)  background,  where  coherent  filtering  (“whitening”)  is 
unavoidable. 
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the  proposed  test  can  protect  against  contaminations  that  would  have  roughly  doubled 
the  Pja  of  the  unrob ustified  quadrature  receiver.  Beyond  that,  the  maximin  bound  on 
Pd  decreases  rapidly  as  a  function  of  the  contamination.  Hence,  this  test  is  rather  inap¬ 
propriate  for  radar/sonar,  where  very  low  Pfa  is  required,  and  the  percentage  contami¬ 
nation  (e)  can  be  much  larger  than  this  probability. 

In  Chapter  4  an  asymptotically  robust  test  is  proposed  and  analyzed.  The  common 
large  sample  assumption  is  an  unavoidable  necessity  in  order  to  get  explicit  functionals 
for  the  error  probabilities,  on  which  optimization  according  to  a  maximin  criterion 
becomes  possible.  However,  we  abandon  the  weak  signal  assumption  that  has  dominated 
the  literature.  By  avoiding  this  assumption,  we  hope  that  the  proposed  test  will  be  ade¬ 
quate  for  the  small  sample  sizes  typical  of  radar/sonar  systems.  Hence,  the  proposed 
test  statistic  is  derived  from  minimax  robust  ML  estimation  of  the  amplitude.  This  is 
achieved  by  applying  Huber’s  [1]  M-estimator  of  “location”  to  the  quadratures  samples, 
and  then  independence  of  the  uniform  unknown  phase  is  obtained  by  summing  the 
squares  of  these  estimates,  exactly  as  in  the  quadrature  matched  filter.  This  test  is 
shown  to  be  maximin  robust  for  sufficiently  large  values  of  the  desired  Pd  (in  the  range 
of  practical  interest)  and  for  several  common  target  signal  models.  Finally,  the  weak  sig¬ 
nal  locally  optimal  robust  detector  for  our  problem  is  outlined,  and  is  shown  to  be  a 
local  approximation  to  the  derived  M-estimation/detection  structure. 

In  Chapter  5,  further  uncertainties  that  must  be  considered  in  practical  applica¬ 
tions  are  studied.  With  regard  to  e  in  the  mixture  model  [1],  it  is  shown  numerically  that 
even  a  very  pessimistic  design  with  e=0.5  incurs  a  rather  small  additional  loss  compared 
to  the  case  when  e  is  known  exactly.  More  important,  the  robust  detector  is  extended  to 
handle  an  unknown  scale  (power  level)  of  the  nominal  noise  density  in  the  mixture  fam¬ 
ily,  by  coupling  it  with  a  robust  estimator  of  the  scale.  While  it  is  not  possible  to  exhibit 
maximin  properties  of  the  extended  test  (as  is  always  the  case  in  multi-parameter  prob¬ 
lems  [3]),  it  is  shown  numerically  to  be  qualitatively  robust  in  the  sense  that  an  upper 
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bound  on  Pfa  and  a  lower  bound  on  Pd  can  be  guaranteed  within  various  parametrized 
mixture  families.  The  latter  bound  is  quite  close  to  the  Pd  obtained  by  the  optimal 
detector  for  the  nominal  Gaussian  noise.  When  “noise  reference”  samples  are  available, 
a  similar  estimator-detector  with  an  adaptive  threshold,  which  serves  as  an  estimator  of 
the  variance  of  the  amplitude  estimator,  is  shown  to  be  maximin  in  the  class  of  “sliding 
window”  detectors.  Due  to  its  CFAR  property  for  any  noise  distribution,  the  restriction 
for  sufficiently  high  Pd  of  Chapter  4  is  relaxed. 

Moreover,  when  the  nominal  noise  is  Gaussian,  the  M-estimators  can  be  replaced  by 
a-  trimmed  estimators,  which  are  asymptotically  equivalent  in  probability  and  simpler 
to  compute.  The  same  structure  can  be  generalized  for  non-Gaussian  nominal  p.d.f. 
with  the  appropriate  optimal  robust  ^estimators.  (The  problems  of  uncertainty  in  the 
scale/ variance  of  the  nominal  density  and  in  e  have  not  been  treated  before  in  detection 
problems).  Finally,  the  signal  frequency  which  is  also  unknown  in  any  realistic  applica¬ 
tion  (  due  to  doppler  shifts),  is  treated  by  constructing  a  bank  of  contiguous  tests  which 
covers  the  uncertainty  range.  It  is  shown  that  the  detectability  loss  for  signals  whose  fre¬ 
quency  straddle  between  adjacent  such  “filters”,  is  asymptotically  identical  with  that 
incurred  by  the  usual  periodogram  (FFT)  detector. 

The  price  that  is  paid  for  the  improved  capability  is  a  substantial  increase  in  signal 
processing  complexity,  compared  to  the  linear  FFT  bank  of  filter-detector.  However,  the 
most  demanding  nonlinear  processing  required  here  consists  of  several  levels  of  data 
rankings,  and  this  operation  is  becoming  possible  for  real-time  radar/sonar  applications 
with  VLSI  and  VHSIC  technology. 

The  final  section  of  chapter  5  presents  a  thorough  Monte-Carlo  study  of  the  small- 
sample  performance  of  the  various  robust  detectors.  The  simulation  results  show  that 
the  performance  predicted  by  the  asymptotic  analysis  of  the  “sliding-window”  robust 
detectors  is  essentially  maintained  even  for  sample  size  =  16,  in  guaranteeing  a  high 
lower  bound  on  the  detection  probability  as  well  as  in  controlling  the  false  alarm-level. 
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When  noise-reference  samples  are  not  available,  somewhat  larger  sample  sizes  are  neces¬ 
sary  for  convergence  to  the  asymptotic  prediction.  In  addition,  the  corresponding  weak- 
signal  locally  optimal  robust  detector  is  shown  to  produce  high  losses  for  small  desired 
Pfa  and  large  deviation  of  the  signal  frequency.  These  losses  are  attributed  to  poor  con¬ 
vergence  to  the  Gaussian  distribution  as  a  result  of  hard  limiting  on  the  test  statistic, 
when  the  signal  is  sufficiently  large  to  allow  high  detection  probability.  On  the  other 
hand,  our  proposed  tests  limit  the  influence  of  outliers  around  the  arbitrary  signal 
amplitude,  and  converge  rapidly  to  the  asymptotic  values. 

As  a  consequence  of  the  asymptotic  analysis  and  simulation  study,  our  tentative 
conclusion  is  that  the  proposed  robust  detectors  emerge  as  quite  adequate  for  success¬ 
fully  treating  most  of  the  important  uncertainties  that  are  encountered  in  the  real-world 
radar-sonar  detection  environment. 

1.2  Review  of  Huber’s  Results  for  Maximin  Testing  of  Simple  Hypotheses 

Huber  [2]  considered  and  solved  the  following  problem.  Let  {x,-  },'l=1  be  a  sequence 
of  independent  random  variables,  and  let  {P0>Pi}  be  distinct  probability  measures  on 
the  real  line  with  the  corresponding  densities  {/ 0,/  1}  with  respect  to  some  measure. 
Assume  that  the  likelihood  ratio  (LR)  /  \{x)/  f  0{x  )  =  L  (x )  almost  surely,  where  L(x)  is 
a  monotone  function. 

Let  M  be  the  set  of  all  probability  measures  on  the  real  line  and  0<e<l  a  given 
number.  The  uncertainty  in  the  distribution  of  the  observations  is  introduced  by 
expanding  the  simple  hypothesis  F0  and  simple  alternative  P  l  into  composite  ones  by 
a  mixture  model- 

H0:  P0  —  {Q  €M  |  Q  =(l-e)P Q+eC 0,  C0SM}  (l.la) 

Hi-  Pi  —  { Q  J  Q  =(l-e)P  i+fCj,  C  i€M}  (1.1b) 

Actually,  Huber’s  setup  is  more  general,  it  allows  also  the  following  neighborhoods 
of  the  nominal  model:  total  variation,  Prohorov  distance,  Kolmogorov  distance  and  L’evy 
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distance.  Moreover,  different  e,-  are  allowed  in  the  Hi . 

The  problem  is  to  find  the  most  robust  test  in  the  maximin  sense  between  P0  and 
P1(  i.e.,  to  find  a  saddle-point  pair  of  test  d*(x)ED,  where  D  is  the  class  of  all  decision 
rules,  and  distributions  such  that 

sup  0(d,q i)  =  p{d*  ,q[)  =  inf  fi(d*  ,q)  ,  (1.2) 

<Je D  ?€ Pj 

subject  to 

sug  a(d,q)  =  a(d*,qo)  =  a0  (1.3) 

?  €Po 

Here,  f3(d  ,q)  is  the  power  of  the  test  (detection  probability)  at  density  q: 

P{d  ,q)  =  Prob  {d(x)=H1\ H i  is  true  (?GPi)}  (1.4) 

and  a(d  ,q)  is  the  level  (false  alarm  probability) 

a{d,q)  =  Prob  {d(x)=Hl  |  H0  is  true  (gGP0)}  (1.5) 

The  meaning  of  the  criterion  is  clear-  d*  is  the  best  Neyman-Pearson  (NP)  decision  rule 

for  the  lest-favorable  pair  {<7o><?i  }• 

As  Huber  showed,  the  most  robust  test  is  a  NP  test  on  the  pair 


The  numbers  L ' 


{l-e)f  0{x) 

L  (x  )<£/  ' 

(1.6) 

?0  —  ■ 

\r?-f  Ax) 

L  (x  )>L'  ' 

(!-«)/  i(») 

L{x)>L' 

(1.7) 

II 

(l-^)L'  /„(*) 

L{x)<L’ 

and  L!  1  are 

determined  such  that  q£  and  <7* 

are  legitimate  density 

functions. 


For  e  sufficiently  small  (a  condition  which  is  equivalent  to  disjointness  of  P0  and 
Pt)  the  normalizing  equations  have  a  unique  solution  with  0<L’  <*L!  '  <oo.  The  LR 
between  q$  and  q[  is  thus  given  by  “soft-limiting”  the  nominal  LR  to- 


L'  when  L(x)<.L' 

l{x,L'  ,L  '  1  )  =  ■  L  (x )  when  L  '  <L  (x  )<L '  ' 

L  '  1  when  L  (x  )>L  '  1 


(1.8) 
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n 

and  a  maximin  robust  test  is  a  randomized  NP  test  on  T  (*)=  n  n**'  ,l>  >  )•. 

1=1 


for 

T(x) 

>t 

<*'(*)=• 

with  probability  c 

for 

T(x) 

=  t 

for 

T(x) 

<t 

The  quantities  t  and  c  are  determined  from  the  right  side  equality  in  Eq.(l.3).  Note  that 
in  general  the  test  must  be  randomized  for  arbitrary  <*0  since  T  (x)  takes  its  values  at 
the  limiter  end-points  with  finite  probability. 

It  is  interesting  to  point  to  some  peculiarity  in  the  result.  The  least-favorable  densi¬ 
ties  are  constrained  by  (1.6-1. 7),  and  in  particular  the  contamination  Cx  can  not  be 
related  in  any  other  way  to  C0.  While  this  is  reasonable  in  a  game  situation  against  an 
intelligent  opponent,  it  seems  unlikely  for  signal  detection  problems  where  the  uncer¬ 
tainty  in  Pi  is  induced  by  that  in  P0  and  is  not  affected  by  the  presence  or  absence  of  an 
additive  signal  whose  characteristics  are  assumed  to  be  known.  For  example,  in  a  “loca¬ 
tion  shift”  problem  (//0:x,-  =n,- ,  ^x,-  =n,-  +a )  we  would  like  P,^  to  be  the  class  of  all 

distributions  that  are  shifted  to  the  right  by  0  from  those  in  P0,  and  specifically 
q  *(x  )=</o  (x-a ),  which  can  not  also  satisfy  (1.6-1. 7).  Thus,  it  is  suggested  that  a 
better  solution  might  exist  for  this  physical  formulation  of  the  problem;  unfortunately,  it 
is  unknown  \  The  asymptotic  reformulation  in  section  4  essentially  avoids  this  peculiar¬ 
ity. 

It  should  be  emphasized  that  Huber’s  proof  of  optimum  robustness  relies  heavily  on 
two  assumptions  which  are  not  valid  for  the  problems  that  are  considered  in  this  work: 
a)  H o  and  Hl  are  simple  hypotheses-  P 0  and  Px  do  not  include  any  unknown  (relevant 
or  nuisance)  parameters.  b)the  observations  are  independent  r.v.’s. 


1  Private  communication  with  P.J. Huber, 12/1983. 
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1.3  Review  and  Critique  of  Robust  Tests  Based  on  Weak  Signal  Assumption 

Often  in  detection  problems,  the  true  value  of  the  signal  amplitude  A  is  small  but 
otherwise  unknown.  In  such  cases  it  is  plausible,  mostly  as  a  theoretical  nicety  which 
permits  an  analytical  (and  usually  quite  simple  for  implementation)  solution,  to  employ 
the  locally  optimum  detector  (LOD)  structure.  This  is  the  detector  structure  which 
maximizes  the  derivative  of  the  power  function  (detection  probability)  at  A— 0,  for  a 
given  test  level  (probability  of  false-alarm),  [25].  Under  suitable  regularity  conditions  on 
the  densities,  the  LOD  is  identical  with  the  detector  which  is  obtained  by  taking  the 
leading  term  of  a  series  expansion  for  the  likelihood-ratio  in  powers  of  the  SNR  around 
zero  [26],  It  also  maximizes  Pitman’s  efficacy  [27]  which  is  a  suitable  weak  signal  measure 
of  performance,  when  the  number  of  the  observed  samples  n  — *oo. 

The  weak  signal  local  optimality  criterion  was  first  extended  to  problems  of  robust 
detection  by  Martin  and  Schwartz  [4],  and  has  been  widely  applied  since  then,  c.f. 
[5], [7]- [10], [12].  Instead  of  seeking  a  maximin  relation  on  the  detection  probability  as  in 
Eq.  (1.2),  it  was  proposed  in  [4]  to  design  for  an  asymptotic  (n  — *-oo)  maximin  relation 
on  the  slope  of  the  power  function  at  A=0  : 


supp  (d,q  o)  =  p  (d*  ,q  o)  =  inf  f?  {d*  ,q0) 
d€D  90eP0 


(1.10) 


d 


where  0  (d,q)—— — f3{d  ,q  |A)|_4_o  and  subject  to  the  false-alarm  constraint  of 
dA 

Eq.  (1.3).  The  interpretation  is  that  d*  is  the  LOD  for  the  least-favorable  density  q$. 
This  criterion  has  resulted  in  all  the  above  mentioned  references  in  a  limiter  -correlator 
structure:  the  locally  optimal  non-linearity  of  the  LOD  receiver  is  robustified  against  e  - 
contamination  by  inserting  a  soft  limiter  at  its  output  and  then  correlating  the  non- 
linearly  transformed  observations  with  the  known  signal  sequence.  This  will  be  denoted 
in  the  following  as  LORD  -  LO  Robust  Detector. 


The  performance  of  LOD  schemes  is  commonly  evaluated  using  asymptotic  meas¬ 
ures  such  as  the  Asymptotic  Relative  Efficiency  (ARE),  which  requires  both  the  assump- 


-  9  - 


tions  of  large  n  (to  satisfy  the  central  limit  theorem)  and  vanishingly  weak  SNR.  In  any 
practical  engineering  application,  the  number  of  samples  must  be  finite  1  and  thus  the 
input  SNR  must  be  reasonably  large  to  obtain  meaningful  detection  probability.  It  fol¬ 
lows  that  LOD  schemes  might  perform  poorly  in  practical  applications.  Several  exam¬ 
ples  of  peculiar  and  very  slow  convergence  of  the  finite  sample  RE  to  the  ARE,  can  be 
found  in  [28]- [31].  These  references  include  examples  where  the  finite-sample/large-signal 
ranking  of  detector  performance  are  actually  different  from  ARE  prediction.  Another 
new  example  is  given  in  Appendix  A.  The  large  SNR  performance  of  the  LOD  would  be 
extremely  poor  when  the  test  non-linearity  redescends  or  even  vanishes  except  for  some 
region  around  the  origin,  i.e.  /(x)=0V|  z  |  >c  .  In  these  cases  we  will  obviously  get 

lim  /?— ►  0,  in  contrast  to  the  desired  consistency  of  the  power  function  with  increasing 

A>>c 

SNR.  This  situation  was  actually  obtained  in  [10].  In  principle,  this  undesired  property 
of  LOD  designs  might  be  corrected  by  switching  between  two  detectors,  where  one  of 
them  is  the  LOD  and  the  other  one  is  some  amplitude-consistent  detector.  The  switching 
should  occur  as  a  result  of  a  threshold  crossing  by  an  estimator  of  the  signal  amplitude 
A  .  This  estimator  must  obviously  be  robust  against  deviations  in  the  assumed  noise 
model  to  prevent  incorrect  switching.  While  this  heuristic  proposal  has  not  been 
analyzed  ,  it  does  suggest  that  an  optimal  robust  detector  should  be  based  on  a  robust 
amplitude  estimator,  as  will  be  studied  in  section  4. 

Inherently,  the  LORD  scheme  is  subject  to  the  same  consistency  problem.  In  addi¬ 
tion,  investigation  of  the  previously  mentioned  references  reveals  that  they  all  exhibit 
one  or  more  of  the  following  peculiarities  and  shortcomings: 

1  In  radar-sonar  systems,  n  is  directly  related  to  the  total  search  time  of  the  desired  sec¬ 
tor,  to  the  desired  maximum  non-ambiguous  detection  range,  and  to  the  spatial  resolution 
of  targets.  Moreover,  the  detectability  of  coherent  signals  is  (asymptotically)  governed 
by  the  average  integrated  SNR  ~nA  2/2<t2.  Since  transmitters  are  usually  constrained 
by  the  average  rather  than  peak  power,  it  is  only  the  product  nA  2that  matters  and  im¬ 
proved  detectability  can  be  obtained  by  increasing  A  as  well  as  n.  Hence,  in  view  of  the 
other  system  design  goals  that  were  mentioned  in  the  beginning  of  this  note,  n  is  usually 
in  the  range  1-100,  and  even  in  1-3  samples  for  very  long  range  systems,  in  contrast  with 
the  common  theoretical  assumptions. 
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i)  The  support  of  the  contaminating  density  c  in  Eq.  (1.1)  is  restricted  to  lie 
exclusively  on  the  exterior  of  some  interval  [-Xi,Xi]  ,  c.f.  [9].  This  might  not  be  a 
severe  limitation  since  the  greatest  deterioration  in  performance  of  the  conventional 
detectors  occurs  for  contaminations  that  are  in  the  far  tails  of  the  nominal  density. 

ii)  The  nominal  density  itself  is  not  a  member  of  the  mixture  family  P0  in  which  a 
least-favorable  density  is  sought,  but  rather  “close”  to  it  in  some  measure,  c.f.  [12]. 

iii)  The  test  is  maximin  robust  only  for  a>a(e),  which  is  roughly  in  the  range  0.05-0.2, 
c.f.  [4], [5],  and  [7].  Hence,  no  robustness  is  guaranteed  for  the  most  important  range 
of  small  a:  lO^-lO-8  ! 

It  is  clear  that  the  maximin  relation  (1.10)  is  neither  sufficient  nor  necessary  for 
obtaining  the  desired  maximin  solution  in  terms  of  the  detection  probability  itself.  Since 
Eqs.  (1.2)  and  (1.3)  are  contradictory  in  nature  when  A  can  take  any  value  in  some  inter¬ 
val,  the  most  that  can  be  expected  from  the  LORD  approach  is  as  shown  in  Fig.  1.1:  the 
shaded  region  on  the  /3  axis  indicates  the  cFs  obtained  for  the  optimal  detector  for  any 
density  q0  in  the  mixture  family;  the  slope  of  any  f3(q 0)  at  A=0  agrees  with  (1.10)  and 
(1.2)  is  satisfied  for  all  signal  amplitudes  that  are  greater  than  some  critical  value  : 
A  ~>AC  or  equivalently  /3>/3c  . 

However,  the  power  function  of  a  LORD  could  as  well  behave  as  shown  in  Fig.  1.2 
where  (1.3)  and  (1.10)  are  satisfied,  but  the  curve  of  0(q q)  dominates  that  of  /S(q0)  V 

A  >0  and  V? o^Po-  (Other  situations  are  also  possible).  Nevertheless,  if  \fq oGPo  the 
distance  between  the  curves  in  Fig.  1.2  could  be  made  sufficiently  small,  i.e.- 
I  ,q d  *  >q o)  |  ) — ^O,  the  detector  would  be  practically  robust,  though 

not  in  the  most  general  maximin  sense.  (In  that  case,  q£  actually  becomes  the  most- 
favorable  density  for  the  detection  probability  !) 

Closer  study  of  the  structure  of  all  the  LORD  tests  in  the  above  mentioned  refer¬ 
ences  reveal  that  implicit  in  all  of  them  is  a  “robust  estimator  of  zero  amplitude”  -i.e. , 
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the  test  is  actually  based  on  an  approximation  of  the  leading  term  in  the  expansion  of  an 
appropriate  version  of  Huber’s  robust  estimator  in  a  power  series  around  zero  amplitude. 
This  observation  establishes  another  motivation  for  the  asymptotically  robust  test  of  sec¬ 
tion  4  which  utilizes  robust  estimation  of  the  amplitude  without  any  small  signal  approx¬ 
imation,  and  thus  essentially  avoids  all  of  the  above  mentioned  problems. 


I 
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0  AC  A 


Fig.  1.1  Qualitative  power  function  curves  for  a  desired  maximin  robust  detector 
of  signal  with  unknown  amplitude. 


0  A 


Fig.  1.2  Qualitative  power  function  curves  for  a  locally-optimal  robust  detector 
which  is  not  maximin  with  respect  to  the  detection  probability. 
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2.  THE  NARROW-BAND  SIGNAX  AND  NOISE  DETECTION 
ENVIRONMENT 


2.1  Models  and  Problem  Statement 

The  problem  to  be  considered  is  that  of  robust  detection  of  a  slowly  fading 
narrow-band  signal  with  unknown  phase  and  amplitude  in  nearly  Gaussian  narrow-band 
noise.  This  signal  model  is  typical  of  most  coherent  radar  targets  [13]  and  also  appears  in 
some  sonar  problems  and  also  in  communication  over  tropospheric  links  [14]. 

Specifically,  the  discrete  signal  plus  noise  samples  that  are  observed  at  the  systems 
input  terminals  are 

Zi  =  As{  cos(o;t,-  +<!>)  +  rii  ,i  =  l,...,n  (2.1) 

The  «,■  are  narrow-band  noise  samples  around  the  known 1  center  frequency  u  : 

ni  —  nc  cos ut  -  n8  sincjf  (2.2) 

{nc,ns}  are  assumed  to  be  zero  mean,  independent 1  2  and  identically  distributed  (i.i.d) 

random  variables  (r.v.),  from  a  nominally  Gaussian  e  -contaminated  mixture  : 

/  (nCJn9)  =  n  /  o (»<:,)/  oK)  (2-3) 

i=l 

where 

1  -a-2 

/0(nc,  =  x)  =  /  oK,  =  x)  =  (l-e)-^==^exp(— )  +  ec(ar)  ,c£M  (2.3a) 

<j>  is  the  unknown  signal  phase  which  is  common  for  all  samples  (coherent  detection) 
and  uniformly  distributed  over  [0,27r].  It  does  not  convey  any  information  on  the  target 
and  is  thus  a  nuisance  parameter  which  has  to  be  averaged  out.  {s,-  },n=1  is  a  sequence  of 
known,  positive,  finite-energy  amplitude  modulations  3.  The  amplitude  A  which  is  also 


1  The  case  of  unknown  doppler  shift  will  be  treated  later  by  the  usual  “bank  of  filters” 
approach. 

Notice  that  sample  -to  -sample  independence  either  excludes  the  common  situation  of 
detection  in  correlated  clutter,  or  it  corresponds  to  a  sub-optimal  scheme  where  the  ob¬ 
servables  are  obtained  by  pre-  whitening  the  input  sequence. 

3  Known  phase  modulations  can  be  introduced  without  much  complication. 
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constant  for  all  the  observed  samples  will  be  considered  in  the  sequel  as  either  deter¬ 
ministic  but  unknown  or  random  with  some  known  distribution  of  unknown  scale.  4 

It  is  well  known,  c.f.  [13],  that  in  the  purely  Gaussian  noise  case  (e  =  0),  a  uni¬ 
formly  most  powerful  (UMP)  test,  independent  of  the  amplitude  being  deterministic  or 
random  with  an  arbitrary  distribution,  is  equivalent  to  a  threshold  test  on  the  “coherent 
sample  envelope”  R  (x,y)  : 

«(*.*)-(  £**«)’ +  (£**)*  (2.4) 

1=1  1=1 

where  the  in-phase  and  quadrature  samples  are  obtained  by  applying  the  input  to  a  pair 
of  lowpass  mixers: 

Xi  =  2 Zf  cos u>t  j  ipp  =  As{  cos</>  +  nc_  (2.5) 

y{  =  22,-sinc ot  |  LPF  =  Ast-sin</>  +  (2.6) 

Since  the  quadrature  components  are  obtained  by  a  reversible  operation,  they  are 
equivalent  statistics  for  the  problem,  which  can  thus  be  reformulated  as: 


H0. 


xi  =  nc, 

Vi  =  \ 


(2.7a) 


x,-  —  nc<  +  A  s,.  coscj) 
Vi  ~  na,  +  As,  sin^ 


(2.7b) 


It  is  desired  to  obtain  a  maximin  relation  like  Eq.  (1.2)  for  all  A  >AC  (recall  the  discus¬ 
sion  in  section  1.3)  subject  to  the  false-alarm  constraint  Eq.  (1.3). 

Direct  application  of  Huber‘s  [l]  solution  for  this  problem  is  inappropriate  since  the 
samples  under  are  actually  dependent  due  to  the  common  random  phase  <j>,  even 
when  A  is  deterministic  and  known.  However,  a  “naive”  extension  of  Huber's  test  is  pos- 


4  These  slow  fading  assumptions  correspond  to  common  radar  targets  whose  cross  section 
(RCS)  fluctuates  with  a  correlation  time  much  longer  than  the  “blip”  duration  nT,  T= 
intersample  period,  but  much  shorter  than  the  scan-to-  scan  period.  This  is  a  most  fre¬ 
quent  situation  [13]. 
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sible  if  one  considers  <j>  as  a  non-random  but  unknown  and  adopts  an  “estimation- 
detection”  procedure.  A  set  of  M— *■  oo  tests  is  performed,  where  the  test  is  matched 
to  <f>j  =j  /2irM  and  its  objective  is  a  decision  about  the  presence  of  a  signal 
of  phase  Each  of  the  individual  tests  is  then  of  the  correlator  -limiter  type  found 
in  [4],  but  the  limiter  break-points  are  now  functions  of  the  S;  and  also  of  <j>j.  Since 
phase  information  is  irrelevant,  the  final  decision  can  be  reached  by  accepting  Hi  if  at 
least  one  of  the  individual  tests  has  accepted  it. 

We  have  not  studied  how  many  parallel  tests  have  to  be  performed  to  achieve 
acceptable  performance  with  arbitrary  signal  phases  between  the  adjacent  “filters”.  It  is 
conjectured  that  it  might  be  prohibitive,  especially  when  the  doppler  shift  is  also  unk¬ 
nown  so  that  a  two-dimensional  “filter-bank”  (phase,  frequency)  has  to  be  constructed. 

In  the  absence  of  any  optimal  procedure  of  design  for  the  finite  sample  problem, 
we  suggest  to  weaken  somewhat  the  requirements.  We  will  do  that  in  section  3  by 
assuming  that  the  observable  is  only  the  envelope  statistic  of  Eq.  (2.4)  rather  than  the 
original  quadrature  samples.  We  can  then  construct  a  most  robust  test  against  the 
uncertainty  in  the  distribution  of  that  scalar  statistic. 

In  section  4,  we  will  extend  the  asymptotic  estimation-detection  approach  of  El- 
Sawy  and  VandeLinde  [6]  to  our  problem,  utilizing  some  of  Huber's  [I]  results  on  robust 
estimation  of  a  location  parameter.  It  is  shown  that  if  A  is  replaced  by  a  robust 
minimax  estimate,  the  resulting  test  statistic  has  a  limited  robust  property. 

2.2  Performance  Degradation  of  the  Conventional  Envelope  Detector. 

In  this  section,  the  degradation  in  performance  of  the  envelope  detector  of  Eq. 
(2.4),  (which  is  the  UMP  detector  for  slow  fading  narrow-band  signals  in  narrow-band 
Gaussian  noise)  will  be  analyzed  when  it  operates  in  a  background  which  comes  from  a 
Huber-Tukey  mixture  family.  The  decrease  in  detection  probability  as  well  as  the 
increase  in  false  alarm  probability  are  of  interest,  where  the  latter  is  of  an  extreme 
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importance  in  an  automated  search  system.  These  results  will  serve  as  a  basis  for  com¬ 
parison  with  the  performance  of  the  robust  detectors  that  are  developed  later  in  this 
work. 

Radar  and  sonar  systems  usually  operate  in  a  highly  non-station  ary  and  non- 
homogeneous  noise  background.  Except  for  the  receiver  thermal  noise  which  is  station¬ 
ary  and  Gaussian,  the  statistical  properties  of  interference  that  is  the  outcome  of  exter¬ 
nal  sources,  (such  as  clutter  reflections  from  ground  sea  or  aerial  objects  in  the  radar 
environment,  or  ambient  noise  and  reverberations  in  the  sonar  case  [14],  [22]),  rapidly 
change  in  time  and  space.  Therefore,  even  when  the  Gaussian  assumption  is  adequate, 
the  fixed  threshold  detector  based  on  the  envelope  statistics  of  Eq.  (2.4)  is  of  little  value, 
since  its  actual  false-alarm  rate  will  fluctuate  intolerably  according  to  the  changes  in  the 
background  noise  level.  A  common  adaptive  detector  for  these  situations,  which  is 
invariant  to  the  power  level  of  the  background  noise,  compares  the  envelope  statistics 
from  the  “test-cell”  with  an  adaptive  threshold  derived  from  a  “noise-reference”  chan¬ 
nel.  Specifically,  {X,Y}  represent  the  narrow-band  observation  samples  in  the 
hypothesis-testing  cell,  {U,V}  are  the  noise-reference  samples  which  are  assumed  to 
have  the  same  distribution  as  that  of  {  X,Y  }  under  H0.  The  adaptive  test  is: 

d  (x,y,'u.v)=iJ !  iff  R(x,y)>tW(u,v)  (2.8) 

where 

R  —I2-\-Q2 ;  f(x)=-E^;  Q(y)=-f>,-  (2-9) 

n 1=1  n  1 

1  M 

W'(u,v)=-L.£(t//+V'/)  (2-10) 

(2U) 

ni= 1  ” i =1 

The  threshold  multiplier  t  is  determined  as  to  achieve  the  desired  false  -alarm  rate. 
(Notice  that  there  are  M  reference  vectors  for  the  test  vector,  all  of  dimension  n). 

In  search  systems  where  a  target  presence  is  to  be  detected  in  some  spatial  sector, 
the  noise  reference  samples  are  easily  obtainable  by  applying  the  same  signal  processing 
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of  the  hypothesis  testing  channel  to  adjacent  resolution  cells  in  range,  doppler  or  bearing 
coordinates,  c.f.  [18]- [20],  This  detector  is  known  as  the  Cell-  Averaging  CFAR(CA- 
CFAR)  ,  Mean  Level  Detector(MLD),  or  Sliding  Window  Detector  (SWD).  When  the 
number  M  of  reference  samples  grow  asymptotically,  its  performance  approaches  that  of 
the  UMP  detector  for  the  fixed-variance  Gaussian  noise.  Its  structure  is  also  almost 
identical  with  that  of  a  narrow-band  version  which  can  be  derived  from  the  Two- 
samples^. e.,  noise- reference)  Students-t  detector  [21],  which  has  some  optimum  proper¬ 
ties  among  unbiased  and  invariant  tests  for  detection  in  Gaussian  noise  of  unknown 
level.  (In  the  extended  t-test,  the  noise  variance  is  estimated  from  both  the  test  and 
reference  samples  in  contrast  to  (2.8).  However,  the  difference  in  performance  is  quite 
small  when  M  is  sufficiently  large,  and  the  structure  of  Eq.  (2.8)  enables  “sliding- 
window”  (SW)  operation  where  R  as  well  as  Uf+V?  are  generated  sequentially  by  the 
same  hardware  during  the  search). 

In  the  following  we  assume  that  the  background  interference  has  an  epsilon- 
contamination  mixture  density,  where  the  nominal  as  well  as  the  contaminating  densities 
in  Eq.(2.3)  are  Gaussian:  /  — (l-e)iV(0,l)+£./V(0,c2).  The  performance  of  the  detector  is 
discussed  first  when  the  number  of  samples  n  is  asymptotically  large,  and  secondly  for 
the  finite  sample  case. 

a)  n  — *oo 

When  n  is  asymptotically  large,  all  the  quantities  I,  Q,  Uj ,  and  V j  of  Eqs.(2.9)- 
(2.11)  are  Gaussian  random  variables.  Hence,  it  is  obvious  that  this  detector  is  asymptot¬ 
ically  nonparametric  under  the  null  hypothesis;  i.e.,  it  is  CFAR  for  any  probability  den¬ 
sity  of  the  input  test  and  reference  samples  for  which  the  central  limit  theorem  holds. 

Under  the  alternative,  for  a  Rayleigh  distributed  slow-fading  signal  the  detection- 
probability  of  this  detector  is  uniquely  determined  by  the  integrated  Signal-to-Noise 
ratio  (c.f.  [18],  [20]):  SNR  =tiA2/2<72.  Since  cr2— (l-e)+ec2  ,  it  is  clear  that  even  a  small 
amount  of  high  power(c2)  contamination  can  reduce  the  effective  SNR  by  orders  of 
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magnitude,  thus  spoiling  detectability  completely. 


b)  Finite  n 

With  regard  to  the  detection  probability  a  similar  behavior  as  above  is  to  be 
expected,  as  it  is  not  significantly  affected  by  the  tails  of  the  distribution.  However,  it  is 
no  longer  true  that  the  detector  remains  CFAR  for  moderate  and  small  sample  sizes. 
Since  in  practice  radar-sonar  systems  are  designed  for  very  low  false  alarm  probabilities, 
it  is  the  tails  of  the  distribution  of  the  test  statistics  that  counts,  and  even  a  slight  devi¬ 
ation  from  Gaussianity  of  the  input  samples  will  suffice  to  substantially  increase  the  area 
under  the  tails. 


In  Appendix  B,  an  approximation  is  derived  for  the  distribution  of  the  coherent 
envelope  R,  which  approaches  the  desired  one-sided  exponential  distribution  for  large 
n.  1  The  approximation  is  based  on  an  expansion  of  the  distribution  in  a  series  of  ortho¬ 
normal  Laguerre  polynomials,  where  its  leading  term  is  (with  proper  normalization  such 
that  E(R)=1)  the  Gamma  density 


/(*)= 


aae~aR  R  a~x 


r(«) 


R  >0 


(2.12) 


where  a=(l+A:/2n)  1  and  k  is  the  kurtosis  of  the  input  samples.  This  is  given  for  a 
Gauss-Gauss  mixture  by 


k 


E(z') 

£V) 


-  3  =  3e(l  -  e) 


c2-l 
1  -  e  +  ec2 


2 


(2.13) 


Notice  that  when  e  — *■  0  but  ec2  »  1,  k  — ►  3/e  »  1.  Table  2.1  demonstrates 
that  k  can  be  very  far  away  from  zero  for  a  Gauss-Gauss  mixture,  thus  a  is  also  much 
smaller  than  1  leading  to  higher  false-alarm  probability. 


1  Actually,  an  exact  analytic  expression  was  derived  for  the  special  case  of  a  Gauss- 
Gauss  mixture,  but  it  was  found  to  be  very  difficult  to  compute  numerically. 
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€  C 

_ 

i  m 

31.62 

0.001 

0.02 

0.1 

mb 

24.3/. 568 
73. 5/. 303 
22.37.589 

748. 5/. 041 
245.4/. 115 
26.477.547 

Table  2.1.  Kurtosis(left  entry)  and  a  (right)  ,  Gauss-Gauss  mixture,  n=16. 

Figures  2.1  thru  2.5  depicts  the  false  alarm  probability  vs.  the  threshold  multiplier 
t,  where  t  is  set  according  to  the  Gaussian  assumption.  In  each  figure,  the  graphs 
marked  “known  sigma”  corresponds  to  the  adaptive  threshold  detector  of  Eq.(2.8)  with 
M  — kx>;  in  that  case  the  variance  estimation  is  error-free  and  the  detector  is  identical  in 
performance  to  the  UMP  detector  for  the  known  variance  case.  The  graphs  marked  “ 
£=0”  give  Pja  when  the  input  is  indeed  Gaussian.  In  all  figures,  n=16  and  M=8.  This 
is  sufficiently  small  for  poor  convergence  to  Gaussianity  of  the  sample  means  in  the  tails, 
but  is  representative  of  the  number  of  samples  actually  employed  in  most  systems.  The 
adaptive-threshold  curve  with  e=0  was  computed  from  Pfa  =(l+t  /M)~m  ,  c.f.  [18],  [20]. 
The  Laguerre  approximation  was  computed  only  for  the  M  — >-oo  case,  since  for  the  adap¬ 
tive  threshold  test  no  similar  approximation  is  possible.  2  The  squares  were  obtained 
from  a  Monte-Carlo  simulation;  it  is  observed  from  the  figures  that  the  Laguerre  approx¬ 
imation  is  quite  reasonable  and  agrees  closely  with  the  simulation  results  in  its  region  of 
low  variance( roughly  the  reciprocal  of  the  number  of  runs  in  the  simulation).  The  main 
conclusion  from  the  figures  is  that  the  increase  in  Pfa  is  very  high,  for  desired  values  of 
1(T4  and  lower  whenever  a,<0.9,  and  that  this  deterioration  is  ordered  in  correspondence 
with  the  deviation  of  k  and  a  from  their  values  at  the  Gaussian  density.  This  increase  in 
Pfa  is  much  more  severe  for  the  adaptive-threshold  detector  with  finite  number  of  refer¬ 
ence  cells;  in  Fig.  2.5,  for  example,  the  threshold  setting  for  10-8  at  the  Gaussian  pro¬ 
duces  about  3T0-2  at  the  contaminated  mixture. 


2  Since  it  is  not  possible  to  obtain  analytic  expressions  for  the  moments  of  the  ratio  R/W 
of  Eq.  (2.8).  While  it  is  possible  to  derive  a  similar  approximation  to  the  distribution  of 
W  and  then  to  integrate  numerically  Pr{R>tW},  this  approach  was  not  taken. 
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Fig.  2.4  false -a)  arm  probability  vs.  threshold  multiplier.  Normal-Normal  ftg.  2.5  false -alarm  probability  vs.  threshold  multiplier.  Normal-Normal 
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3.  ROBUST  TEST  ON  THE  SAMPLE  ENVELOPE 


When  non-contaminated  hypotheses  contain  nuisance  (vector)  parameters  0,  with 
known  distribution  they  are  actually  simple  hypotheses,  and  the  optimal  test  is  a 
Neyman-Pearson  test  on  the  likelihood  ratio  of  the  averaged  densities: 

L  (x)  =  £»,[/  ,(x  |  0,)1/.E„J/  „(x  I  «„)]  (3.1) 

An  attempt  to  incorporate  averaging  into  the  proof  of  optimality  of  Huber’s  test  fails 

due  to  the  fact  that  the  marginal  density  is  in  general  not  a  product  of  densities  of  the 

n 

components  z,- :  E 9[f  (x  |  0)]  7^  n  E  g[f  (z,-  |  (?)],  unless  n  =  1.  When  n  —  1,  the 

i  =1 

structure  of  Huber’s  test  extends  to  the  case  where  there  are  nuisance  parameters;  the 
only  correction  needed  is  to  replace  L  (z )  of  section  1.2  by  L  (z  )  =  E s\f  x(z  |  9)\/ /  0(z ), 
provided  it  is  a  monotone  function  of  x . 

Therefore,  in  the  absence  of  an  optimal  procedure  for  the  design  of  a  robust  test  on 
the  original  observables  {z,- ,  y,  }/Lx  of  Eq.  (2.7),  it  is  proposed  to  weaken  the  require¬ 
ments  and  to  consider  instead  a  most  robust  test  on  some  scalar  statistic.  Since  for 
e  =  0  the  sample  envelope  R  (x,  y)  =  (J]  z,)2  -I-  (£]  yt)2  is  a  sufficient  statistic  for  the 
problem,  and  its  distribution  is  independent  of  the  phase,  it  is  natural  to  try  to  robustify 
a  test  based  on  R  . 

Thus  we  will  consider  the  modified  problem: 

po  =  {tf  (#  )  €  M  |  q(R  )  —  (1  -  e'  )  f  0(R  )  +  ('  c0(R),  c0  6  M} 

PX  =  {<?  (R  )  €  M  |  q  (R  )  =  (1  -  c'  )  /  ,(R  )  +  e'  c  X(R  ),  c  x  €  M} 

where  e1  is  the  amount  of  contamination  on  the  envelope  R. 


If  one  knows  how  to  find  6,-  of  the  original  observables  (by  physical  reasoning,  estima- 
tion  or  guessing),  he  should  be  able  to  apply  his  procedure  for  finding  e 1  .  Nevertheless, 
it  can  be  verified  that  if  the  uncertainty  in  the  measure  is  given  by  a  mixture  model,  non¬ 
linear  operations  on  random  variables  do  not  change  e  and  summation  of  n  i.i.d.  contam¬ 
inated  r.v’s  results  in  £„  =  1  -  (1  -€,•)",  from  which  we  get  d  — 

l-(l-Ei)2" 


-  24  - 


We  now  apply  this  approach  to  Gaussian  contaminated  quadrature  noise  com¬ 
ponents  and  Rayleigh  distributed  signal  amplitude.^  It  is  well  known  [13]  that  the  nom¬ 
inal  densities  are  given  by: 

f  i{R  |  A  )=<tr2  exp[-cr£2(R  +n2A2)\I0(A  \/R  jo2)  ,  R  >0  (3.2) 

/  y{R  )=Ea  [f  t(R  I  A  )]=[<rJ(l+5„  )]-1exp[-R  /<rg(l+Sn  )]  ,  R  >0  (3.3) 

where  <T2—Efo(nc2)=Efo(n2),  aR=2n  <t2,S=Aq  /2cr2  is  the  input  signal  to  noise  ratio 
under  the  nominal  conditions,  Aq  —E(A2)  and  Sn  =nS . 

The  least  favorable  pair  is  given  by  Eqs.(l. 6-1.7),  with  /  x(i? )  from  (3.3)  and  f  0(R) 
by  the  same  expression  with  Sn  —  0,  and 


L(R  )=(l+Sny1exp(KR)  ,  R  >  0,  K  =Sn  /<rR(l+Sn  ) 


(3.4) 


As  L  (R )  is  monotone,  an  equivalent  test  statistic  is 


T(R)  = 


„  2  _  / 
aR  r 

R 


R  <  <tr  r ' 

<rRr'  <  R  <  aRr'  '  (3.5) 

R  >  aRr  ' 

where  the  limiter  breakpoints  are  found  by  solving  the  normalization  equations  (C.2-C.3) 
of  Appendix  C.  These  equations  yield 


2  '  ' 
aR  r 


exp^  (1+SB)  1]-KH-5„)"1exp[r/  Sn{l+Sn)  1][l-exp(-r/  )]=(l-c;  )  1  (3.6) 

l-exp(-r/  1  )+(l+5„  )exp[-r/  '  Sn  (1+SB  )“1]exp[-r/  1  (l+5n  )_1]==(l-e/  )_1  (3.7) 

It  is  shown  in  Appendix  C  that  the  resulting  maximin  test  can  take  three  different 
forms,  depending  on  e  and  the  desired  a0.  Roughly,  when  e/a0  is  large,  it  is  a  random¬ 
ized  test  where  Hi  is  decided  with  probability  c  <1  if  R  >t .  For  intermidiate  values  of 
e/cco,  it  is  a  deterministic  threshold  test  on  R  ,  and  for  small  values,  H0  is  decided  with 
probability  c,  if  R  <t .  Equation  (3.7)  is  explicitly  solved  by 

This  correspons  to  Swerling  I  target  model  [17],  which  is  a  very  good  description  of 
the  radar  cross  section  (RCS)  fluctuations  for  microwave  frequencies,  where  the  target  size 
is  much  larger  than  the  wavelength  so  that  the  quadrature  components  are  due  to  sum¬ 
mation  of  many  independent  reflectors  and  the  central  limit  theorem  holds  -  [13]. 
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r"  =ln[5„(l-e'  )/e'  ]  (3.8) 

while  solutions  for  the  first  one  can  be  found  numerically  from  the  set  of  solutions 

{o  <  1}  of 

Sn  a  +a_5*=(H-5n  )(l-e'  )-J  ,  a=exp[-r/  (1+SJ-1]  (3.9) 

Table  3.1  shows  the  values  of  the  “sufficiently  small”  ec  1  for  which  P0  and  Px  are  dis¬ 
joint  and  the  maximin  test  exists  [V  (e'  )<£/  1  (e1  ),Ve'  <ec'  ).  These  are  given  as  a 
function  of  the  integrated  signal  to  noise  ratio  Sn ,  the  natural  measure  of  distance 
between  the  hypotheses  for  this  problem. 


Sn 

lcr2 

KT1 

1 

10 

102 

103 

3.5  KT* 

4  ■  1(T2 

.2 

.42 

.49 

.498 

.5 

Table  3.1  Values  of  critical  contamination  ec '  vs.  integrated  nominal  signal-to- 
noise  ratio  S„  —  nE  [A2) /2a2. 

A  suitable  measure  of  performance  of  a  robust  test  is  how  far  is  the  lower  bound  on 
the  power  /3(d* ,  q*),  from  the  power  of  the  Neyman-Pearson  test  for  e=  0  ,  /?0 .  For 

our  problem  (30  =  a q^1  +  S"\  and  using  (3.7)  with  (C.7)  and  (C.15)  of  Appendix  C,  we 
get 


/5(d*,  q*)  = 


(W  )*./<!  +  *.>  (an_e,  )■/<■  +  *.)  <J>0_ 

i  + 

sj(i  +  sn)  5„a0  (3‘10) 


«o 

1  +  5n 


1-e' 


e  > 


1  +  Sn 


The  different  expressions  correspond  to  cases  b)  and  c)  in  the  appendix.  The  last  one  is 
valid  for  those  desired  values  of  a0  when  the  limiter  is  “effective”  and  one  must  resort  to 
a  randomized  test,  which  in  turn  causes  faster  decrease  of  the  power  curve.  Case  a)  of 
the  appendix  is  never  applied  in  this  Rayleigh  signal  example,  as  it  was  found  numeri- 
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cally  that  it  corresponds  to  false-alarm  probabilities  higher  than  0.5. 

Figures  3.1  -  3.2  depict  fi{d*  ,q*)  versus  e'  for  different  values  of  e'  and  Sn  .  All 
the  graphs  are  clearly  characterized  by  a  sharp  “knee”  at  e'  ^a0Sn  /(1+S„ )  c*0,  such 

that  when  e1  is  larger  than  this  critical  value  the  power  of  the  test  deteriorates  rapidly. 
Notice  that  even  an  increase  of  orders  of  magnitude  in  the  effective  signal  to  noise  ratio 
Sn  does  not  help  to  alleviate  the  problem.  The  figures  also  show  the  false  alarm  proba¬ 
bility  of  the  optimal  detector  for  the  uncontaminated  case  ,  when  it  actually  operates  in 
a  worst-case  contamination  under  H0:  a(N.P.  ,w.c.)=a0(  1-e'  )-t-e/  .  (Under  Hl  the 
worst  case  power  is  (l-e'  )/30  which  is  insignificantly  lower  than  f30  for  the  small  e'  s  con¬ 
sidered). 

Thus  we  conclude  that  the  proposed  test  can  protect  against  contaminations  which 
would  have  roughly  doubled  the  false  alarm  probability  of  the  Neyman-Pearson  test 
(e'  <C2o’0).  Beyond  that,  this  protection  is  achieved  at  an  intolerable  price  of  decrease  of 
the  power.  In  radar  applications,  <*0  is  very  small  and  its  variation  is  significant  only  for 
an  order  of  magnitude  changes.  Thus  the  test  proposed  in  this  section  is  not  very  useful 
in  many  practical  situations. 

A  somewhat  similar  robust  detector  for  a  Rayleigh  signal  in  nearly  Gaussian  noise 
was  recently  studied  by  Shin  and  Kassam  [11].  Motivated  by  insight  gained  from  the 
structure  of  various  robust  detectors,  they  suggested  inserting  a  limiter-squarer  non¬ 
linearity  at  the  outputs  of  the  in-phase  /=X/Z>  an^  quadrature  Q  =Y]Vi  channels, 
summing  the  outputs  and  comparing  to  a  threshold.  The  amount  of  limiting  was  optim¬ 
ized  numerically  and  the  performance  was  analyzed.  Comparing  Fig.  6  of  [11]  with  Figs. 
(3.1)-(3.2)  here,  it  is  observed  that  the  scheme  of  [11]  is  somewhat  better.  However,  the 
numerical  analysis  of  [11]  was  carried  out  only  for  the  range  e  <  a0,  and  it  is  not  known 
whether  the  performance  degrades  further  when  e  is  orders  of  magnitude  higher  than 
a0S3^  At  the  least,  this  comparison  demonstrates  that  it  is  advisable  to  “push  back  the 
^  Private  communication  with  the  authors. 
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limiting”  (or  more  precisely,  an  adequate  censoring  of  the  outliers  as  will  be  clear  from 
Chapter  4)  as  far  as  possible  from  the  threshold  comparison  point  to  the  original  observ¬ 
ables. 


Figure  3.1  a)  False  alarm  probability  that  would  have  resulted  when  worst-case  con¬ 
tamination  is  applied  to  the  UMP  test  for  purely  Gaussian  noise,  vs.  e.  b)  Maximin 
bound  on  the  detection  probability  vs.  e.  a0  =  1CT* 


a(  NP.W.C.) 
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Figure  3.2  a)  False  alarm  probability  that  would  have  resulted  when  worst-case  con¬ 
tamination  is  applied  to  the  UMP  test  for  purely  Gaussian  noise,  vs.  e.  b)  Maximin 
bound  on  the  detection  probability  vs.  €.  ckq  =  I0-2 
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4.  ASYMPTOTICALLY  MAXIMIN  ROBUST  TEST 

4.1  Introduction 

In  this  section  a  test  will  be  derived  which  is  asymptotically  (n  — ►  oo)  most  robust 
in  a  maximin  sense.  In  a  following  section,  we  will  show  by  means  of  simulation  that 
this  test  maintains  its  performance  even  for  small  sample  sizes. 

Utilizing  some  of  Huber’s  [l]  results  on  robust  estimation  of  a  location  parameter, 
El-Sawy  and  VandeLinde  [6]  derived  an  asymptotically  most  robust  test  for  the  problem 
of  detecting  a  completely  known  signal  in  additive  noise  of  uncertain  distribution.  A 
similar  approach  is  possible  (and  turns  out  to  be  optimal)  for  the  problem  formulated  in 
Section  2.1.  We  observe  the  similarity  between  the  narrowband  slowly  fading  coherent 
signal  and  the  lowpass  deterministic  signal:  for  a  given  received  sample  {X,  Y},  A  cos <j> 
and  A  sin <j>  are  essentially  unknown  location  parameters  for  the  quadrature  signal  com¬ 
ponents.  Moreover,  in  purely  Gaussian  noise  the  unknown  phase  (which  is  a  nuisance 
parameter)  is  “averaged  out”  in  the  UMP  test  statistic  R  =  I2  +  Q2,  where  I  and  Q 
are  the  sample-means  of  the  in-phase  and  quadrature  components  of  the  narrowband 
observations.  I  and  Q  are  the  Maximum-Likelihood  estimates  of  the  component  loca¬ 
tions  and  thus  R  is  a  good  estimate  of  the  amplitude  (squared)  which  is  the  “true” 
parameter  of  the  problem,  i.e.,  the  one  that  distinguishes  between  H0  and  Hx. 

Building  further  on  insight  gained  from  the  UMP  statistic  R ,  we  observe  that  in 
the  purely  Gaussian  case  the  detection  probability  /3=EA  E ^Prob  {R  >t  \  A  ,<f>}  is 
analytically  tractable  as  a  result  of  the  unbiasedness  and  Gaussianity  of  /  and  Q  under 
both  hypotheses.  Since  in  order  to  exhibit  a  maximin  property  for  a  composite  testing 
problem,  it  is  necessary  to  obtain  analytical  expressions  for  the  error  probabilities  func¬ 
tionals,  it  appears  natural  to  preserve  the  structure  of  the  R  test,  but  to  replace  the 
sample-mean  estimates  with  robust  estimates  of  the  locations  {A  cos 4>,  A  sin<^>},  which 
will  have  the  following  properties.  The  estimates  should  be: 
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i)  Shift  invariant  (or  unbiased)  for  all  n  . 

ii)  Efficient  and  consistent. 

iii)  Asymptotically  Gaussian. 

iv)  Possess  the  minimax  property  on  the  estimation  variance  (which  is  translated 

into  the  effective  SNR  when  iii  is  valid). 

v)  Maintain  the  above  properties  for  medium  and  even  small  n  . 

Huber’s  M-estimator  [lj,  and  the  cx-trimmed  mean  estimator  (when  the  nominal 
p.d.f.  is  Gaussian)  have  all  these  properties.  This  facilitates  the  proof  of  maximin 
optimality  of  the  proposed  test  when  the  amplitude  is  non-random.  For  random  ampli¬ 
tudes  which  are  of  more  interest  in  practical  applications,  the  expectation  of  the  detec¬ 
tion  probability  over  the  amplitude  must  be  taken.  It  turns  out  that  for  the  most  fre¬ 
quently  used  target  model  in  radar  applications  (Rayleigh  amplitude  or  Swerling  case  I), 
the  maximin  property  of  the  proposed  test  is  only  “almost”  preserved  (recall  Fig.  1.2), 
but  it  is  maintained  exactly  for  a  higher  order  chi-squared  amplitude  model.  (Rayleigh  is 
first  order  while  constant  amplitude  is  the  limit  of  the  chi-square  family  when  the 
number  of  degrees  of  freedom  tends  to  infinity). 

For  the  sake  of  continuity  and  convenience  in  the  exposition,  we  will  begin  with  the 
non-random  amplitude  case,  and  then  move  to  the  more  realistic  random  amplitude 
models.  Throughout  this  section  the  amount  of  contamination  e  in  the  Huber-Tukey 
model,  as  well  as  the  variance  of  the  nominal  p.d.f.,  are  assumed  to  be  known.  These 
strong  assumptions  which  are  inherent  in  previous  work  on  robust  detection  [4]- [12],  but 
are  rarely  satisfied  in  practice,  will  be  relaxed  in  Section  5. 

4.2  Non-random  Signal  Amplitude 

Before  presenting  the  test,  we  need  some  definitions.  Define  a  class  of  functions  'F 
on  R  1  such  that  {L  ,  l  }  €  ¥  if: 


(4.1) 
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1)  L  is  convex,  symmetric  about  the  origin  and  strictly  increasing  for  positive  argu¬ 
ment. 


2)  l{x)  —  dL  / dx  is  continuous. 

3)  for  all  q  6  P0,  0  <  Eq  [l\x)}  <  oo. 


4)  for  all  q  €  Po,  -rr  Eq  ix  -  0)]  exists  and  is  nonzero  at  6  =  0. 

uu 


Define  the  estimates  by  the  following  implicit  equations 


^  *(/ )=Arg  {  £  l (x,— S,- A,  )=0}  ,  A y(l )=Arg  {  £  /  (y,- ~si  Ay  )=0}  (4.2) 

i =1  i=l 

(^4X  and  Ay  are  the  robust  M-type  estimates  [1]  for  the  location  parameters  A  cos <f>  and 
A  sin(/»  of  the  observables  in  Eq.(2.7b),  for  a  given  realization  of  the  random  variables  A 
and  <f>.)  Define: 

Tn(lo)  =  [A  ,('„)]*  +(i,(i„)l!  (4.3) 

where 


l oix)  =  ~d  [log  ? o  ix )]/ dx  (4.4) 

and  go  €  Po  minimizes  Fisher’s  information: 

^(?o)  <  A?)  =  /  [-£  log  g(x)]2  q{x)dx,  V  g  €  P o  (4.5) 

Consider  a  threshold  test  on  T„  (l0) 

f  H i  ,  Tn(l0)>tn(l0)=t(l0)/n 

dn{x>  y )  \  H0  ,  otherwise 

and  let  T(l )  be  any  other  test  based  on  -Ax(/)  and  Ay(l)  with  /  6  'F  but  L  ^  -log  q§. 
The  following  proposition  states  the  asymptotic  maximin  properties  of  the  test  dn*,  which 
we  will  subsequently  refer  to  as  the  SSQME  (Sum  of  SQuared  M-type  Estimates)  test. 

Proposition  4.1  Consider  the  detection  problem  defined  on  the  observables  of  Eq.  (2.7) 
with  the  uncertainty  in  the  independent  quadrature  noise  samples  obeying  an  e-  mixture 
model  P0  with  symmetrical  p.d.f.  When  A  is  nonrandom  and  0  is  uniformly  distributed 
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on  |0,  27r],  among  the  class  of  all  tests  T (l )  based  on  Ax(l)  and  Ay(l)  of  Eq.  (4.2)  with 
/  6^,  there  exists  a  least-favorable  p.d.f.  q o  G  Po  and  a  constant  0  <  /3C  <1  such 
that  the  SSQME  test  d ^  defined  in  (4.6)  with  n  — ►  oo  achieves: 

sup  3(d,qo)  =  0(dZo,qo)=  inf  /3(d^,  q)  (4.7) 

def(l)  q  €  P0 

whenever  /3(d^  ,  <7o)  ^  0c  an<^  subject  to 

sup  a(d,q)  =  a{d^,qo)  =  a0  (4.8) 

d  €  T(l) 
q  6  P0 

Moreover,  when  the  nominal  p.d.f.  in  P0  is  Gaussian,  the  nonlinearity  /0  is  a  sym¬ 
metric  soft  limiter  whose  breakpoints  depend  on  the  noise  variance  but  not  on  the  signal 
amplitude: 


l0{x  /<j2,-K  /a, K  /a)  = 


K(e)/a 
x  jo2 
-K{e)/a 


x  >  K  (e)a 

-K(e)a  <  x  <  K(e)a  (4.9) 

x  <  -K  (e)c t 


with  K  (e)  and  e  related  by 


/  (t))/K (t)  -  *(-A: (<))  =  e/2(l  -  e) 


(4.10) 


and  f  oix)  ~  d$(x)/dx  is  the  standard  normal  p.d.f. 


Comments 

1)  Note  that  the  theorem  shows  the  saddle-point  pair  of  p.d.f.  and  test  only  among  a 
certain  class  T  (l ).  We  would  like  the  left  relation  in  (4.7)  to  be  valid  for  any 
hypotheses  test  on  the  original  observables  {X,  Y}.  Unfortunately,  this  could  not 
be  proved,  unlike  the  case  of  a  deterministic  signal  in  6),  as  will  be  clear  in  the 

sequel.  However,  the  class  T(l)  is  quite  large.  As  limiting  cases  it  contains  the 

sample  means  of  X  and  Y  when  e  =  0^,  the  sample  medians  (e  =  1)  and  with 

Thus  the  SSQME  test  is  better  (at  least  asymptotically)  than  the  “scalar”  envelope 

test  which  was  derived  in  the  preceding  section. 
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weak  regularity  conditions  which  usually  holds  [1]  Ax  and  Ay  can  be  any  transla- 

n  n 

tion  invariant  statistics,  such  as  A  x=  a,-  provided  £]aj=l  are  the 

t=l  i =1 

rank  ordered  samples  x^  <  x^)  <  <  ),  and  similarly  for  Ay. 

2)  The  theorem  remains  valid  (but  l0  given  by  a  different  expression  than  (4.9))  for 
any  other  description  of  the  uncertainty  in  the  noise,  provided  the  asymptotic  vari¬ 
ance  of  the  M-estimates  (4.2)  computed  with  / 0  is  the  maximum  achievable  over  the 
class  of  densities  in  P0.  An  example  is  the  P  family  [6].  Others  can  be  found  in 

[1],  [3]- 

3)  In  contrast  to  Huber’s  finite  sample  size  test,  this  asymptotic  test  satisfies  a  more 
desired  and  natural  optimality  criterion.  Namely,  the  maximin  relation  for  the 
power  Eq.  (4.7)  is  specified  in  terms  of  the  uncertainty  in  P0  alone,  allowing  the 
uncertainty  in  Px  to  be  induced  by  that  in  P0  as  a  shift  of  the  densities  of  the  qua¬ 
drature  samples  (recall  the  discussion  in  Section  1.2). 

4)  Extension  to  unknown  frequency  is  straightforward  by  constructing  a  parallel  bank 
of  SSQME  tests,  the  iih  best  being  matched  to  u>,-  =  i/nT  where  T  is  the  sam¬ 
pling  period.  This  will  be  discussed  in  more  detail  in  section  5.3.  The  individual 
test  is  shown  pictorially  in  Fig.  4.1.  Note  that  a  routine  for  solving  for  a  zero  of  a 
function  is  needed. ^  Thus,  a  real  time  implementation  in  a  radar  or  sonar  system 
calls  for  substantially  larger  number  of  calculations  than  encountered  in  the  more 
common  detection  systems  that  incorporate  linear  filtering,  FFT  and  at  most,  pass¬ 
ing  the  samples  through  a  nonlinearity. 


^  This  is  true  except  for  the  case  where  the  nominal  p.d.f.  of  the  uncertainty  family  is 
Gaussian,  where  a  considerable  simplification  is  possible  as  will  be  shown  later. 


Figure  4.1-  Block-diagram  structure  of  SSQME  test  for  robust 
detection  of  slow-fading  narrowband  signal. 


Proof  of  proposition  4.1  We  need  to  study  the  asymptotic  distribution  of 
T(l),  I  6  It  can  be  derived  using  the  following  lemmas  from  [1],  with  a  slight 
modification  from  [6j. 

Lemma  1  Whenever  /  £  ♦  and  q  is  any  symmetric  p.d.f.,  conditioned  on  A  and  ©, 
\fn  (Ax(l)-Ax)  and  \fn  (Ay(/)-Ay)  are  asymptotically  distributed  as  normal  r.v.’s  with 
zero  mean  and  variance  V (l ,  q  ) 


where  the  parameters  {Ax  ,Ay )  are  the  true  values  of  the  locations  (0  under  H0, 
Ax  =A  cos©  and  Ay  =A  sin<^  under  H{). 
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Lemma  2  Let  q  £  P0  be  an  e-contaminated  mixture  family  where  the  nominal  p.d.f.  /  0 
is  symmetric  and  twice  continuously  differentiable,  such  that  -log  /  0  is  convex  on  the 
convex  support  of  F 0.  Whenever  l  £  if,  there  exists  q$  £  P0  which  minimizes  Fisher’s 
information  (4.5)  over  P0  and  an  /0  £  if  given  by  (4.4).  Thus,  the  asymptotic  variance 
of  the  M-estimates  computed  with  /0  at  the  least  favorable  p.d.f.  q$  satisfies  the 
minimax  relation 

sup  V(l0,  q)  =  V{10,  g„)  =  l//(<?o)  =  inf '  V(l,  qj)  (4.12) 

9  €  F0  l  €  * 

where  q  g  is  given  by 


q  o(»)  =  (1  ~€)' 


f  a(xo)eK{X  "2o)  *  <  *o 

f  0{x)  x0  <  x  <  x1 

fo(*iWK{X~Zl)  X  ^  x' 


(4.13) 


and  x0  <  xx  are  the  endpoints  of  the  interval  where  |  /  0'  / /  o  I  <  AT,  and  A"  is 
related  to  e  through 


/  fo{*)dx  + 


/  o(^o)  +  /o(*i)  1 


(4.14) 


K  1-6 

*0 

We  note  that  the  right  inequality  in  (4.12)  is  a  consequence  of  Ax(/0)  and  Ay(l0) 
being  the  maximum  likelihood  (ML)  estimates  of  the  corresponding  location  parameters, 
when  the  underlying  p.d.f.  is  q$,  under  both  hypotheses.  In  addition,  they  are  efficient 
under  the  assumed  conditions  on  i.e.,  they  achieve  the  Cramer-Rao  lower  bound 
^_1(?o)-  The  left  inequality  follows  from  the  fact  that  q$  minimizes  I(q)  over  P0. 
Relaxation  of  the  symmetry  requirement  and  other  noise  uncertainty  models  are  treated 
in  [1],  [3]  and  [6]. 


From  Lemma  1  and  the  mutual  independence  of  X  and  Y,  Ax(/)  and  Ay(i)  are 
independent  and  jointly  Gaussian  when  4>  is  fixed.  Averaging  over  <f>  is  straightforward 
with  transformation  to  polar  coordinates  and  yields 
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f(Ax(l),Ay(l)\A)  =  Et\f(Ax(l),  A,(/)M,*)/  Ml  =■•■  (4.15) 


n 

T{1)  +  A2 

A  Tll\l) 

2 vV(l,  q)  6XP 

2  V{l,q)/n 

1 0 

V(l,  q  )/n 

T(l)  £  A?  (/)+  Ay  (/) 


where  /q(-)  is  the  zeroth  order  modified  Bessel  function.  Also, 


f(T(l)\A)  =  *f(Ax(l),Ay(l)\A)  (4.16) 

Now,  restricting  ourselves  to  the  class  of  tests  based  on  Ax(l)  and  Ay(l)  as  the 
new  observables  of  the  problem,  the  likelihood-ratio  is  A(AX(/ ),  A  Jl )  |  A  )= 
exp(-nA  2/2  V)  I Q[nATl!\l)/  V\  which  is  monotone  increasing  in  T (l )  for  any  A  ,  deter¬ 
ministic  or  random.  Therefore,  in  view  of  the  Neyman-Pearson  Lemma  T(l)  is  a 
sufficient  statistic  and  a  threshold  test  based  on  it  is  UMP  for  H A  >0  vs. 
Hq:  A  =  0. 


Unlike  the  known  lowpass  signal  case  that  was  treated  in  [6],  it  can  not  be  proved 
here  that  T  (/ )  is  a  sufficient  statistic  of  the  original  observables  {X,Y}.  There,  where 
the  additive  signal  was  non-random,  use  was  made  of  theorems  by  Wald  [23]  and 
Chernoff  [24]  that  a  test  based  on  the  ML  estimate  0  (i.e.,  solution  to  [x{  -9)=0  with 
l=(-logf  )'  6^)  is  asymptotically  equivalent  to  a  NP  test.  In  our  problem,  T(/0)  is  not 
the  ML  estimate  of  A2  when  the  underlying  quadrature  noise  p.d.f.’s  are  q q  '3|  .  How- 

2 

ever,  its  expected  value  is  E(T(l))  =  A2  +  —  V{l,q).  Thus,  it  is  asymptotically 

n 

unbiased  and  the  variance  of  T(Iq)  has  a  minimax  property  by  virtue  of  Eq.(4.12). 
Indeed,  it  can  be  shown  that 

Uar(n1/2T(/))  =  4A2V{1,  q)[l  +  —  V(l,  9  )]  4  A2V{l,q) 

n  n  »  A  2 


Hence,  T1,/2(/0)  is  a  good  robust  estimate  of  A  ,  and  as  was  discussed  before,  the  class 


w 


The 


MLE 


of 


is 


given 


by 


solving 


— —  log  E  n  q$  (ij—  A  cos  <f>)  q  g  (y,-  -  A  sin  <^)]  =  0,  which  is  analytically  in- 
d A  i  —  l 

tractable  unless  q q  is  identically  Gaussian. 


{  r  (/ )}  is  quite  large. 


Having  shown  that  in  the  restricted  class  of  tests  the  optimal  procedure  must  be  a 
threshold  test  on  the  envelope  T(l),  it  remains  to  validate  Eq.(4.7)  subject  to  (4.8). 

Using  (4.16)  with  the  definition  of  Marcum’s  Q-function  Q  (a ,  b  )= 

00 

f  x  exp 
b 

threshold  t(l)  in  (4.6)  such  that  a(l ,  q£)  =  c*0  while  for  the  pair  (/0,  q )  the  threshold 
remains  t(l0),  the  various  false-alarm  and  detection  probabilities  are  as  summarized  in 
Table  (4.1). 


/3(l,q)  =  Q\a  =AVn/V(l,q),b  =  Vt  {l)/V{l ,  q  )] 

{l,  9o) 

expH(0/2U(/,  9o*)l 

a  =  Ay/n/V(l,  fo ),  b  =  bo 

■ 

exp[-t(/0)/2V(/0,  9o)] 

a  —  A  yjn  /V{10,  ?o  )  =  ao,  b  =  V~2  loS  ao  =  bo 

(^o>  q ) 

exp[-t(l0)/2V(l0,  ?)] 

a  =  A  y/n/v(l 0,  q)  =  a0y/V(l0,  qS)/V(lQ,  q) 

Table  4.1-  False  alarm  and  detection  probabilities  for  various  test-p.d.f.  pairs. 


[ — -(x  2  +  a2)]  IQ(ax  )dx ,  and  recalling  that  when  l  7^  /0  we  are  free  to  adjust  the 
& 


Note  that  the  parameter  a2/ 2  defined  by 


T<  '■«> 


nA ' 


2  V(l,q) 


(4.16a) 


is  the  “effective  integrated  SNR,”  by  analogy  with  the  UMP  detector  for  the  purely 
Gaussian  noise  where  the  integrated  SNR  is  given  by  the  same  expression  with  V  — *■  a2. 


From  the  first  and  second  row  of  the  table,  by  monotonicity  of  the  Q-function  with 
respect  to  the  first  argument  (a  )  when  the  second  is  fixed,  and  since  from  Lemma  2 
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sup  (3(1,  q o)  =  (3(1 o,  go  ) 

/  e  ♦ 

This  proves  the  left  hand  side  of  (4.7). 

From  the  first  column  of  the  second  and  third  row  of  the  table,  since  the  exponent 

function  is  monotonic  increasing  when  V  increases,  by  virtue  of  the  other  inequality  in 

Lemma  2,  we  get  sup  a(/0,  q )  =  a(l 0,  q q).  It  remains  to  find  under  what  conditions 
i  ef>o 

@{Iq,  q )  =  Q(a0C  ,b0C)  >  /3(l0,  <z0*),  where  C  (q ,  q£)A  y/V(lQ,  ToiWih,  71  >  1  for 
all  q  E  P0.  Note  that  the  Q  function  is  monotonic  in  either  of  its  arguments  only  when 
the  second  is  fixed,  but  here  they  both  change  proportionally  and  the  inequality  is 
analytically  intractable.  Since  numerical  computation  of  the  Q  function  is  notoriously 
difficult,  we  give  only  a  sufficient  condition  by  solving  for  a0  with  a  fixed  b0  those  values 
which  satisfy  d/3(l0,  q)/dC  >  0,  C  >  1.  Using  known  properties  of  the  partial 
derivatives  [15,  Appendix  F[: 

exP[  ~^~\I \{xy  )  -  ~ -yexp[— x  }I0(xy  )  (4.17) 

we  obtain  an  equivalent  relation  which  is  sufficient  for  the  right  hand  side  of  Eq.  (4.7) 

TTTT^  >  -  .  V  C  >  1  (4.18) 

fo(aobo&  )  ao 

From  the  definition  of  the  modified  Bessel  functions,  I\/Iq  is  monotonic  increasing 
from  0  to  1  when  the  argument  increases  from  0  to  infinity,  so  it  suffices  to  solve  (4.18) 
for  C  =  1.  By  continuity  and  monotonicity  of  both  sides  of  Eq.(4.18),  a  solution  exists, 
and  upon  fixing  the  false  alarm  probability  (60  =  y/-2  log  <a0)  and  by  virtue  of  the 
monotonicity  of  the  Q  -function  with  respect  to  a0,  the  solution  of  Eq.  (4.18)  with  equal¬ 
ity  defines  a  critical  value  /3C  such  that  when  (3(1 0,  q  o)  =  Q{a  o,  b  0)  >  (3c , 

inf  (3(1  o,  q  )  =  (3(Iq,  ?o  )  ^  Pc  ■  This  completes  the  proof  of  proposition  4.1 
?  €  P0 

Some  numerical  values  of  (3$  are  given  in  Table  4.2.  Recall  that  they  are  found 
from  a  sufficient  condition  so  that  they  are  a  conservative  estimate  for  the  range  of 


I 
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maximin  optimality;  we  see  that  for  typical  false  alarm  levels  0C  ~  0.60. 


<*n 

10"1 

10“2 

10"3 

10"4 

10"5 

10~6 

-*■  0 

ao 

2.41 

3.20 

3.86 

4.41 

4.92 

6.16 

— +  b  „ 

0n  —  Q  kn>  bn 

0.68 

0.64 

0.62 

0.60 

0.60 

0.59 

HRXKXfl 

Table  4.2  -  Critical  values  of  /3{l0,  qo  )  for  which  the 
SSQME  test  is  maximin  robust,  non-random  amplitude. 

We  note  that  the  test  in  [6]  for  a  completely  known  signal  is  asymptotically  maxi¬ 
min  robust  only  for  0C  =  0.5  (regardless  of  a0),  a  fact  that  was  not  directly  stated 
there.  Closer  examination  of  the  situation  in  [6]  reveals  that  0C  =  0.5  is  due  to  the 
symmetry  of  the  normal  p.d.f.  of  the  test  statistics  around  its  mean  under  both  H0  and 
H v  This  is  not  the  case  for  the  SSQME  test,  and  results  in  a  higher  value.  Neverthe¬ 
less,  when  designing  a  radar,  sonar  or  communication  system,  the  desired  goal  is  in  high 
quality  detection.  Consequently,  systems  are  specified  to  achieve  /?=0.8-0.95,  and  the 
technical  parameters  which  result  in  the  required  SNR  are  designed  accordingly.  There¬ 
fore,  obtaining  the  robustness  of  the  SSQME  test  only  for  a  limited  range  of  0  is  of  no 
practical  limitation. 

From  the  expression  for  0(1 0,  q q  )  in  Table  4.1,  it  is  clear  that  the  maximin  bound 
is  only  slightly  inferior  to  the  detection  probability  in  the  uncontaminated  case,  where  0 
is  given  by  the  same  equation  but  with  a  =  A  x/rT  /<t.  This  is  so  since  for  small  e  , 
rv*(e)  *V(l0,  qo)/cr2  «  1,  as  can  be  seen  in  Table  4.3.  (See  also  [3,  p.  87]). 


e 

0 

.001 

.002 

.005 

.01 

.02 

.05 

.1 

.15 

.2 

.25 

.4 

rv  *(e) 

1 

1.01 

1.02 

1.04 

1.07 

1.12 

1.26 

1.49 

1.75 

2.05 

2.40 

4.0 

Table4.3-  Upper  bound  on  the  estimation  variance  (and  upper  bound  on  the 
loss  in  the  effective  integrated  SNR),  rv*(e)  A  V(l0,  qo)/cr“. 

SSQME  test  for  narrowband  signal  with  non-random 
amplitude  in  nearly  Gaussian  noise. 
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The  corresponding  maximin  bound  on  the  detection  probability  is  depicted  in  Fig. 
4.2  for  various  values  of  oq,  along  with  the  results  of  the  robust  test  on  the  envelope  of 
section  3  (marked  RET).  All  the  curves  are  computed  with  fixed  input  SNR  =  A2 /2a'2, 
such  that  the  detection  probability  at  the  nominal  Gaussian  p.d.f.  for  all  a0’s  is  0.9. 
Very  similar  behavior  was  obtained  for  other  SNRs.  A  dramatic  improvement  in  perfor¬ 
mance  compared  to  the  RET  is  clearly  observed.  While  the  effectiveness  of  the  RET  is 
limited  to  contaminations  which  are  smaller  than  roughly  twice  the  desired  false-alarm 
probability,  (note  that  RET  curves  with  a0  <  10~4  are  outside  of  the  range  of  the  figure) 
the  SSQME  test  can  protect  against  e  which  is  of  orders  of  magnitude  higher  than  a0, 
before  the  detection  probability  decreases  substantially,  roughly  up  to  e  =0.1  -  0.2. 
(The  discontinuous  portion  of  the  curves  indicate  that  the  test  may  not  be  maximin 
there  in  accordance  with  Table  4.2.) 
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Figure  4.2-  Maximin  bound  on  the  detection  probability  vs.  e,  narrowband 
signal  with  non-random  amplitude  and  unknown  phase  in  nearly  Gaussian  noise. 
Full  lines:  SSQME  test  (discontinuous  lines  indicate  that  the  test 
may  not  be  maximin).  Dotted  lines:  RET. 


c 
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4.3  Random  Signal  Amplitude 

In  radar-sonar  applications,  modeling  targets  with  non-random  amplitude  is  an 
oversimplification.  ^  Since  in  nominal  Gaussian  noise  the  coherent  envelope  test  is  UMP 
regardless  of  the  signal  amplitude  statistics,  it  seems  natural  to  try  to  also  use  its  robust 
version,  the  SSQME  test,  for  random  amplitudes.  Therefore,  we  will  study  the  robust¬ 
ness  of  the  SSQME  test  by  averaging  the  asymptotic  power  of  the  deterministic  case 
with  respect  to  the  assumed  amplitude  p.d.f.  (Obviously,  the  false  alarm  probability  is 
bounded  as  before.)  Two  cases  will  be  treated. 

a)  Swerling  Case  1  Target. 

This  is  the  most  common  radar  target,  composed  of  many  independent  scatterers. 
The  amplitude  is  Rayleigh,  f  (A)  —  2A  exp(-A  2/A^)/A^,  for  A  >  0,  A 7  A E  (A 2).  It 
turns  out  that  the  performance  of  the  SSQME  test  is  somewhat  less  desirable  than  that 
for  the  deterministic  signal  case. 

Proposition  4.2  Under  the  conditions  of  proposition  4.1  and  a  Rayleigh  distributed  sig¬ 
nal  amplitude,  the  SSQME  test  with  the  nonlinearity  /0  of  Eq.(4.4)  satisfies 
fflo,9><Wo,9o)  V  ?  GPo  subject  to  Eq.(4.8)  -  i.e.,  go  is  the  least-  favorable  p.d.f. 
under  H0  but  the  most  -  favorable  under  Hx.  However,  when  the  nominal  noise  p.d.f.  in 
P0  is  Gaussian  and  /0  is  the  soft  limiter  of  Eq.(4.9),  the  right  side  inequality  of  Eq.(4.7) 
can  be  replaced  by 

Wo,  q)>C  0(1  o,  ? o')  ,  o  <  C  <  1  (4.19) 

?  €  P0 

such  that  the  constant  C  (which  generally  depends  on  e,  a0  and  /3(l 0,  q q))  approaches 
unity  as  e— *-0,  a0— > >0  and  as  /3(/0,g o  )— H- 

Proof  Upon  averaging  with  respect  to  A  ,  the  power  of  the  SSQME  test  is  easily  found 
to  be 

^  Except  when  the  target  is  a  perfect  reflecting  sphere. 
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/?(/ ,  q  )  =  exp 


-t(/)/2V(/f  g) 
1  +  S(l,q) 


=  ao1/!1  +  5(/-  ?)1 


where  S(l ,  q)  A  nA^/2V(l ,  q).  Thus  suj^  0(1 ,  q£ )  =  0(lo,  q£)  since 


(4.20) 


log 


Wj_q  o) 
0(loi  7o  ) 


=  logo0 


2[V(l,qZ)-V(l0>  g0*)] 

[1  +  S-l(l,qZ)\  [l  +  S-Vctfo')] 


<0  ,  v  /  e  *  (4.21) 


by  virtue  of  the  right  side  inequality  in  (4.12).  Unfortunately,  the  right  side  inequality 
of  (4.7)  is  never  satisfied  here: 


(3(1 0,  q)  =  <Xo/[T+S{i°’(l°)]  ,  r  A  -  J-y  -  <  1  V  7  €  P0  (4-22) 

v(lo,  7o ) 

Thus  0(lQ,  q q),  which  is  given  by  (4.22)  with  r  =  1,  is  always  higher.  This  is  actually 
the  second  case  that  was  discussed  in  section  1.3. 


However,  the  SSQME  test  will  practically  be  *qalmost”  optimal  robust  (in  the  sense 
that  for  V  7  6  Po  the  performance  is  bounded  in  a  narrow  neighborhood: 

I  /?(/„,  7o  )— /?( ^ o> 7 )  I  <  ao,  A  )  — ►  0,  according  to  the  discussion  preceding  Fig.  1.2 
of  section  1.3),  if  the  right  hand  side  of  the  maximin  relation  (4.7)  could  be  replaced  by 
Eq.(4.19)  such  that  the  constant  C  (which  generally  depends  on  e,  a0  and  0(lo,  (?o))  Is 
very  close  to  unity.  From  Eq.(4.22),  C  is  obtained  for  the  p.d.f.  for  which  r  achieves  its 
smallest  possible  value  over  P0.  Some  algebra  yields 


log  C  = 


1 

1  —  r  min(€) 


log  orp 

log  0(1  o,  7  0  ) 


log  0(1  o,  7o  ) 


(4.23) 


It  is  clear  from  this  expression  that  c  — ►  1  monotonically  as  a0— *-0  and  as  rmjn— *-1. 
Some  basic  calculus  shows  that  this  convergence  is  also  true  when  /?(/0,7o  )~ +1-  At  this 
stage  we  restrict  the  class  P0  to  the  nominal  Gaussian  case.  Using  Eqs.  (4.9)  and  (4.11) 

K 

K2Pq  {  |  x  |  >  K  }  +  f  x2q(x)dx 

-K 
K 

[f  q(x  )dx  ]2 
-K 


V(l o,  7) 

?  =  (1  -  £)?0  + 


(4.24) 
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where  <70  is  the  normal  p.d.f.  and  K(e)  is  given  in  (4.10).  It  is  not  difficult  to  see  that 
(4.24)  and  hence  also  r  are  uniquely  minimized  when  h  is  a  point  mass  at  the  origin,  as 
each  of  the  terms  in  the  numerator  achieves  its  minimal  value  and  the  denominator  its 
maximal  value;  using  Eq.(4.24)  with  (4.10)  yields 


r  min(^) 


inf 

=  (1  -  d?o  +  ch 


VVo,  g) 

V(l0,  ?o) 


hence  r  min  — *■  1  as  e  — ►  0. 


fj) 

[1  +  e  V (/o,  ?o  )]2 


(4.25) 


t 

.001 

.01 

— 

.1 

.2 

.3 

.5 

m !■ 

2.63 

1.945 

_ Ui _ 

.685 

.436 

an 

- -  min^) 

.9910 

TTTM 

.349 

.177 

.028 

0.2 

.995 

.97 

.78 

.62 

.52 

.44 

in 

1 

.999 

.994 

.96 

.93 

.91 

.89 

1-2  TO-5 

.999 

.998 

.998 

.998 

0.2 

.997 

.98 

.89 

.81 

.72 

10  4 

(3*  =  0.5 

1-5  10^ 

.997 

.98 

.97 

.95 

0.9 

1-1  10^ 

1-7  10"5 

1-5  TO-4 

.999 

.999 

.999 

0.2 

.998 

.99 

.93 

.88 

.84 

.81 

106 

0*  =  0.5 

EsSESI 

.998 

.986 

.98 

.97 

.96 

0.9 

Kim 

1-5  10'5 

ffnTTriB 

■i — aria 

1-7  TO'4 

0.2 

.9987 

.991 

.95 

91 

.88 

.86 

10~8 

0*  =  0.5 

1-2- 1(T* 

.99 

.98 

.98 

.97 

0.9 

1-5  TO-6 

1-2- 10^ 

1-4  TO-4 

■ffirrM 

■Etnmf 

Table  4.4  -  The  constant  C  such  that  /?(/0,  q)  >  C  q o)> 

V  q  €  Po-  SSQME  test  for  Sw.l  target 
in  nearly  Gaussian  noise. 
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The  computation  of  (4.23)  with  (4.25)  is  summarized  in  Table  4.4.  We  observe  that 
for  the  typical  small  a0’s  and  large  /?0’s  desired  in  radar-sonar  systems,  C  is  almost 
indistinguishable  from  1,  even  for  very  large  contamination  e.  Thus,  for  any  q  6  Po  and 
for  the  the  range  of  practical  interest,  the  asymptotic  power  is  almost  equal  to  the  maxi- 
min  bound,  and  the  SSQME  test  is  “almost  robust”  as  conjectured.  As  in  the  deter¬ 
ministic  amplitude  case,  the  performance  of  the  SSQME  test  is  characterized  by  the 
identical  decrease  in  the  effective  SNR  5(/0,  qQ)/(nA^/2a2),  which  is  very  small  for 
small  e.  Thus  the  qualitative  behavior  as  in  Fig.  4.2  is  repeated  here,  with  dramatic 
(asymptotic)  improvement  over  the  test  of  section  3. 

Further  thought  reveals  that  this  “almost  robustness”  is  a  consequence  of  always 
using  the  same  fixed  threshold.  If  it  were  also  possible  to  precisely  estimate  the  variance 
of  the  test  statistic  under  H0  and  to  adjust  the  threshold  accordingly: 
t  (q  *  )=-21na0  V{q^  )— *t  [q  )=-2lna0  V{q  )=-21no:o^(?  )>  then  we  would  obtain 

fi(q  )=Q!Q1+5^0’f  ^  >f3{q  * )  V<?  €  Po-  Since  in  practical  applications  almost  always  the 
scale  (variance)  of  the  nominal  noise  p.d.f.  is  also  unknown,  adaptive  thresholding  will 
be  unavoidable  in  conjunction  with  the  robust  structure  to  maintain  CFAR.  This  will 
be  treated  in  more  detail  in  Section  5.2 

Comparing  the  known  lowpass  signal  [6]  and  the  non-random  amplitude  cases  with 
the  Rayleigh  amplitude  case,  some  thought  suggests  that  the  power  fails  to  satisfy  the 
desired  maximin  relation  exactly,  due  to  an  absence  of  a  mode  in  the  p.d.f.  of  the  test 
statistic  T(Iq )  in  the  present  case.  To  verify  this,  we  also  consider  the  following  case 
which  does  possess  a  mode,  and  represents  some  physical  targets. 

b)  Swerling  Case  3  Target. 

This  model  corresponds  to  one  dominant  reflector  plus  Rayleigh;  the  amplitude 
p.d.f.  is  /  (A )  =  8A3exp(-2A2/X2)/A~22  for  A  >0  [17],  In  this  case,  the  desired 

behavior  of  the  deterministic  amplitude  case  is  possible. 
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Proposition  4.3  Proposition  4.1  is  valid  for  a  Swerling  3  target  model. 

Proof  The  power  of  the  SSQME  test  is  found,  in  a  manner  similar  to  [13],  by  averaging 
the  non-random  amplitude  result  with  respect  to  the  above  amplitude  p.d.f.,  and  is  given 

by 


0(1,  q)  = 


1  + 


t(l)/2V(l,  q) 


(1  +  S(l  ,q  )/2)  (1  +  2/5(/ ,  <]f ) 


exp 


-t(l)/2V(l,q) 
l  +  S(l,q)/2 


(4.26) 


log  Op  S(l,  q)/ 2 
(l  +  5(/,g)/2)2 


l/[l+5(/,  ?)/2] 


from  which  sup  0(1 ,  q o  )  =  0(1 0,  q$)  follows  by  virtue  of  4.12  with  some  algebra.  Also, 
I  6* 


0(h,q)  = 


1  _  log  a0S(l0,  q o)/2 
(r  +  S(l0,  qo)/2f 


l/[r+S(lv  ? o)/2) 
a0 


(4.27) 


where  r  =  U(/0,  q )/  ^(^o>  <7o )  5:  1  35  before.  A  sufficient  condition  for 
0(lo,  q)  >  /9(/0,  ,0*)  Vr  <  1  is  obtained  from  solving  d0(lo,  q)/ dr  <0,  V  r  <  1. 
Upon  differentiating  (4.27)  a  quadratic  inequality  in  S(l0,q o)  is  obtained,  whose  single 
proper  solution  is  given  by 


<?I7„  >  Sr 


/,  9 


It  is  easy  to  see  that  Eq.(4.26)  is  monotonic  increasing  when  S(l0,  q q)  increases,  hence 
the  maximin  relation  (4.7)  is  fully  satisfied  whenever  0(lo,  q$)  >  0C  where  0C  is 
obtained  upon  substituting  Sc  in  (4.26) 

The  critical  values  are  given  in  Table  4.5;  they  are  somewhat  higher  than  those  of 
the  deterministic  amplitude  case,  but  still  in  the  range  of  practical  interest.  Since  the 
one-dominant-plus-Rayleigh  p.d.f.  corresponds  to  Chi-squared  p.d.f.  of  A2  with  2  degrees 
of  freedom,  while  the  non-random  amplitude  can  be  regarded  as  the  limit  of  the  Chi- 
squared  family  when  the  number  of  degrees  of  freedom  N  tends  to  infinity,  it  is  conjec¬ 
tured  that  whenever  the  amplitude  squared  belongs  to  this  family  but  N  7^  1  (Ray- 
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leigh),  there  exists  a  f3c  such  that  when  j3(l0,  qo)~>  Pc  the  SSQME  test  is  asymptoti¬ 
cally  maximin  robust,  and  pc  decreases  when  N  increases.  It  seems,  however,  that  each 
case  needs  a  separate  derivation  to  prove  this  conjecture. 


«0 

10_1 

10"2 

10'3 

10^ 

10“5 

10"6 

io-8 

5.36 

9.70 

14.1 

18.7 

23.2 

27.8 

36.9 

Pc 

0.78 

0.75 

0.743 

0.740 

0.738 

0.738 

Table  4.5  -  Critical  values  of  P(Iq,  q o)  for  which  the 
SSQME  test  is  maximin  robust,  Swerling  3  target. 

4.4  Relationship  with  Weak  Signal  LORD 

For  a  given  observation  sequence,  the  M-estimates  defined  by  Eq.  (4.2)  are  fixed 
numbers;  they  are  expected  to  be  close  to  zero  under  H0  and  also  under  H i  when  A  is 
small,  due  to  the  robust  properties  of  the  estimator.  Thus,  an  approximation  is  obtained 
by  expanding  the  summands  of  (4.2)  in  a  Taylor  series  in  powers  of  A  ,  and  then  solving 
(4.2)  using  only  the  two  leading  terms.  This  yields 

.  A'(x<)  i  - 

A  SJi - si  s  /(*,•)  (4.29) 

<=i 

The  second  approximation  is  justified  as  follows:  For  nominal  Gaussian  p.d.f.  with 
the  optimal  robust  nonlinearity  /0,  the  denominator  just  equals  the  number  of  £;  which 
are  in  [-if  (e),  if  (e)];  it  is  quite  close  to  n  when  e  is  small  (since  if(e)  is  then  larger  than 
the  variance  of  the  samples),  and  only  weakly  changes  for  different  realizations  of  x. 

All  of  the  locally-optimal  robust  detectors  that  have  been  studied  in  [4],  [5],  [9],  [10] 
and  [12]  are  based  on  this  test  statistic,  and  thus  they  all  hide  an  “approximate  robust 
estimator  of  zero.”  It  is  worthwhile  to  note  that  the  weak  result  of  [4]  and  [5],  namely, 
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the  LORD  satisfies  the  maximin  relation  Eq.  (1.10)  on  the  slope  of  the  power  function  at 
A  =  0  only  for  a  >  am,n(e)  is  actually  a  consequence  of  this  approximation.  (This 
lower  bound  should  be  noted  is  much  larger  than  false-alarm  levels  of  any  practical 

N  E?(A) 

interest  ).  Though  the  incremental  SNR  =  lim  - — ; —  is  identical  with  (4.16a), 

A  -  o  VAR  {A) 

the  numerator  and  denominator  are  somewhat  different  from  those  stated  in  Lemma  1 


E1(A)\a_o  =  A  Eq(l‘  )+0 (A2);  VAR{A)\A=Q=±-Eq{l2)  (4.30) 

n 

Thus  there  are  actually  two  related  optimization  problems  (for  o;  and  j3  ),  which  are  not 
parametrized  by  the  same  functional  V(l,q)  as  in  the  SSQME  test;  this  lead  to  a 
different  condition  that  must  be  satisfied  for  Eq.(l.lO)  to  hold  -  see  [4],  [5]. 

The  same  approach  could  be  taken  for  our  problem.  The  structure  of  Eqs.  (4.3)- 
(4.6)  is  preserved,  but  the  M-estimators  are  replaced  with  A  of  (4.29).  The  locally- 
optimal  robustness  can  be  verified  using  similar  techniques  as  those  in  [4j.  Under  the 
regularity  conditions  stated  in  [4],  which  require  more  than  the  conditions  in  (4.1),  the 
central  limit  theorem  is  satisfied  and  the  detection  probability  is  again  Marcum’s  Q- 

function  where  the  parameters  {a,  6}  are  modified  according  to  (4.30).  Using  the 
3 

expression  for  -  Q  (a,  b  )  in  Eq.  (4.17),  along  with  /^O)  =  0,  If  (0)  7^  0.  it  can  be 

da 

shown  that  the  right  side  inequality  of  Eq.  (1.10)  is  satisfied,  and  thus  the  detector  is  LO 
robust,  for  a  limited  range  in  the  false  alarm  probability  given  by 

«o  >  «min(e)  =  min  [d(?)e(?)]~2/|e(?)_l1  (4.31) 

?  e  P0 

where 


and 


<*(?)  = 


Eq{l o'  ) 

V'o'  ) 


d{q)  >  1  V  q  €  P0 


(4.32) 


e(<?)  = 


V'o2) 

Eq(ii) 


e{q)  >  1  \i  g  €  Po 


(4.33) 
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where  l0  —  -(In  q  q)'  is  the  locally-optimal  nonlinearity.  The  inequality  in  (4.32)  is 
valid  since  lQ'  is  zero  over  the  interval  where  the  least-favorable  p.d.f.  go  of  Eq.  (4.13) 
puts  all  of  its  contamination;  the  one  in  (4.33)  is  valid  due  to  the  last  mentioned  fact  on 
go  and  to  the  additional  regularity  requirement  that  l0  is  monotonic  nondecreasing. 
Because  of  this,  the  false  alarm  bound  of  Eq.  (1.3)  is  obtained  also  here  for  V  ot  £  (0,  1), 
but  not  by  the  same  derivations  as  for  the  SSQME  test.  Utilizing  these  inequalities,  it  is 
easily  verified  that  amin(e)  €  (0,  1).  However,  we  have  not  solved  (4.31)  to  see  if  the 
actual  values  are  small  enough  to  be  acceptable,  as  we  believe  that  the  SSQME  test  is  a 
preferred  solution;  it  is  neither  restricted  by  any  a:mjn(e)  nor  limited  to  weak  signals.  This 
conclusion  has  been  verified  by  simulation  results,  to  be  presented  in  Section  5.4 
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5.  ROBUST  DETECTORS  FOR  REALISTIC  ENVIRONMENT 

The  derivation  of  maximin  robustness  of  the  SSQME  test  in  section  4  was  based  on 
several  major  assumptions,  all  of  which  have  been  required  for  previous  studies  of  robust 
detection  [4]- [12].  The  following  were  assumed  as  completely  known:  a)  the  percentage 
of  contamination  e,  b)  the  nominal  p.d.f.  in  the  class  P0;  specifically,  the  variance  was 
assumed  known  for  the  nearly  Gaussian  case,  c)  the  frequency  of  the  signal  to  be 
detected.  Such  knowledge  concerning  the  detection  environment  is  usually  absent  in 
any  realistic  application  where  robust  procedures  are  needed.  This  is  particularly  true 
under  the  detection  environment  of  radar  (sonar)  systems  where  the  clutter  (reverbera¬ 
tion)  processes  rapidly  change  in  time  and  space.  This  section  extends  the  utility  of  the 
SSQME  test  for  such  situations,  and  ends  with  a  finite-sample  Monte-Carlo  analysis. 

5.1  Sensitivity  to  Unknown  e 

In  all  previous  works  [4]- [12]  the  amount  of  contamination  e  was  assumed  known. 
Clearly,  this  might  not  hold  for  a  nonstationary  and  nonhomogeneous  detection  environ¬ 
ment.  Theoretically,  when  any  knowledge  of  e  is  lacking,  the  mixture  family  is  no  longer 
convex  and  is  not  well  defined  to  allow  an  optimum  robust  test.  In  practice  though,  it 
might  be  possible  to  bound  e<emax  based  on  physical  considerations.  (If  emax=l>  the 
robust  estimators  of  the  SSQME  test  reduce  to  median  estimators). 

An  obvious  but  apparently  unstated  observation  is  that  any  density  of  the  form 
/  ( x  ;e)  of  Eq.  (2.3a)  belongs  to  the  mixture  family  Po(emax)  */ /  e57max: 

(l_e)/o+e/(l  =  (1_emax)/  0+emax[(l - ““)/  <7  “ — h  i  =  '  '  ‘  (5-l) 

^max  ^max 

•••  =  (l-emax)/o+emax^'  ,  (/ 0,M  '  )^M 

and  the  expression  in  square  brackets  h'  is  a  legitimate  density  because  it  is  positive 

(l-e/emax>0)  and  integrates  to  1. 

Table  5.1  shows  the  asymptotic  variances  of  the  M-estimators  of  Eq.(4.2)  when  the 
input  noise  density  is  a  Gauss-Gauss  mixture  and  c 2  is  the  variance  of  the 
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contaminating  density.  They  were  computed  from  Eq.(4.1l)  with  the  soft-  limiter  of 
(4.9-4.10).  In  each  sub  table  the  limiter  break-point  k  is  designed  according  to  emaX)  and 
the  variances  are  given  for  e<emax.  Also  indicated  in  the  headings  are  the  upper  bounds 
on  the  estimation  variance  computed  with  the  least-favorable  density.  It  is  seen  (and  can 
be  shown  analytically  using  the  properties  of  the  Gaussian  c.d.f.  when  solving  (4.11)  in 
conjunction  with  (4.14))  that  the  variance  monotonically  increases  with  e  and  c  to  the 
minimax  bound.  A  similar  behavior  occurs  with  the  detection  probability  since  it  is  a 
monotone  function  of  the  effective  SNR  Eq.(4.16a),  which  is  inversely  proportional  to  the 
estimation  variance. 

The  main  conclusion  from  the  table  is  that  even  when  the  test  is  designed  for  large 
emax,  the  variance  is  much  smaller  when  actually  e«emax.  An  even  more  important  con¬ 
clusion  is  drawn  from  Table  5.2,  which  shows  a  very  pessimistic  case:  the  test  is  designed 
with  k(emax  =  0.5),  and  the  entries  in  the  table  are  the  ratio  of  variances  between  those 
obtained  with  this  design,  and  those  that  could  be  obtained  if  e  was  actually  known  and 
k(e)  was  used-  Var  (k  (emlu<.),e)/Var  ( Ar(e),e).  It  is  clearly  seen  that  even  for  e=0.001  the 
efficiency  loss  is  merely  about  28%.  This  is  translated  to  an  increase  of  only  1.07  dB  in 
the  SNR  required  to  achieve  the  same  P4.  When  emax  can  be  more  tightly  bounded,  the 
difference  is  even  smaller. 

Similar  behavior  is  found  when  the  contaminating  density  is  the  longer  tailed 
Laplace  density,  as  can  be  seen  by  dividing  the  entries  of  Table  5.3a  by  the  correspond¬ 
ing  ones  in  5. 3b. (Here  however,  the  variance  is  not  monotone  with  e  for  small  c ). 

To  summarize  this  discussion,  we  have:  A  design  with  emax  preserves  the  maximin 
robustness  properties  for  \/e<emax,  and  the  loss  incurred  from  not  knowing  the  actual  e 
is  shown  numerically  to  be  reasonably  small  for  the  Gauss-Gauss  and  Gauss-Laplace 


mixture  families  considered. 
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Table  5.1 


Estimation  variances  with 


fixed  limiter  k(emax)  for  e<e 


Gauss-Gauss  mixture. 


EPSMAX=0. 005  K(EPSMAX)=  2.160  VMAX(K)=1 . 037 


F=(l 

-EPS)*N(0/ 

1)+EPS*N(0, 

C**2) 

EPS/C 

1 

3 

10 

30 

100 

0.0010 

1.0066 

1.0097 

1.0116 

1.0123 

1.0125 

0.0020 

1.0066 

1.0127 

1.0167 

1.0180 

1.0184 

0.0030 

1.0066 

1.0158 

1.0218 

1.0237 

1.0243 

0 . 0040 

1.0066 

1.0189 

1.0269 

1.0294 

1.0303 

o.ooso 

1.0066 

1.0220 

1.0320 

1.0351 

1.0362 

it  ******* 

***************************************************** 

EPSMAX=0 . 

010  K ( EPSMAX ) =  1.945 

VMAX(K) 

=1.065 

F=<1 

-EPS) *N( 0, 

1 ) +EPS*N( 0 , 

0*2) 

EPS/C 

1 

3 

10 

30 

100 

************************************************************* 

0.0020 

1.0121 

1.0177 

1.0210 

1.0220 

1.0224 

0 . 0040 

1.0121 

1.0233 

1.0300 

1.0321 

1.0328 

0 . 0060 

1.0121 

1.0289 

1.0391 

1 . 0422 

1.0432 

0.0080 

1.0121 

1.0346 

1 . 0482 

1.0523 

1.0538 

0.0100 

1.0121 

1 . 0403 

1.0573 

1.0626 

1 . 0644 

************************************************************* 

EPSMAX=0 . 

050  K( EPSMAX )=  1.399 

VMAX(K) 

=1.256 

F=(l 

-EPS ) *N( 0, 

1)+EPS*N(0, 

C**2 ) 

EPS/C 

1 

3 

10 

30 

100 

************************************************************* 

0.0100 

1.0467 

1.0695 

1.0807 

1.0840 

1.0852 

0.0200 

1.0467 

1.0928 

1.1157 

1.1226 

1.1250 

0.0300 

1.0467 

1.1165 

1.1518 

1.1625 

1.1662 

0 . 0400 

1.0467 

1.1408 

1.1890 

1.2037 

1.2088 

0.0500 

1.0467 

1.1655 

1.2274 

************ 

1.2464 

1.2530 

************* 

max* 


Table  5.1  (continued) 
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TaUe  5. 1  (continued) 


EPSMAX=0 . 100  K(EPSMAX)=  1.140  VMNC(K)=1 . 490 


r-<i 

-EPS)«N<0, 

,  1)*EPS*N(0,C**2) 

EPS/C 

l 

3 

10 

30 

100 

0.0200 

1.0812 

1.1239 

1.1433 

1.1490 

1.1509 

0.0400 

1.0812 

1.1685 

1.2092 

1.2214 

1.2254 

0.0600 

1.0812 

1.2149 

1.2793 

1.2989 

1.3053 

0.0800 

1.0812 

1.2634 

1.3539 

1.3817 

1.3909 

0.1000 

1.0812 

1.3139 

1.4333 

1.4706 

1.4829 

EPSWUC«0. 

200  K(EPSMAX)*  0.862 

VMMC<KJ=2.046 

F-<1 

-EPS)*N(0, 

1)»EPS*N(0,C*»2) 

EPS/C 

1 

3 

10 

30 

100 

0.0400 

1.1393 

1.2221 

1.2574 

1.2678 

1.2706 

0.0800 

1.1393 

1.3121 

1.3904 

1 . 4142 

1 . 4208 

0.1200 

1.1393 

1 . 4i02 

1.5411 

1.5818 

1.5933 

0.1600 

1.1393 

1.5175 

1.7125 

1.7748 

1.7928 

0.2000 

1.1393 

1.6350 

1 . 9083 

1 . 9982 

2.0249 

EPSMAX-O. 

300  K(EPSMAX<-  0.685 

VMAX(K) 

=2.822 

F-U 

-EPS)»N(0, 

1)«-EPS*N(0.C**2) 

EP3/C 

1 

3 

10 

30 

100 

0.0600 

1.1916 

1.3162 

1.3684 

1.3838 

1.3870 

0.1200 

1.1916 

1.4580 

1 . 5802 

1.6177 

1.6260 

0.1800 

1.1916 

1 . 6200 

1.8367 

1.9058 

1.9220 

0.2400 

1.1916 

1 . 8061 

2.1508 

2.2655 

2.2941 

0.3000 

1 . 1916 

2.0214 

2.5405 

2.7219 

2.7698 

EPSMAXsO. 

400  K(EFSMAX)°  0.555 

VMAX(K)' 

=3 . 996 

F»<1 

-EPS ) *N( 0 , 

1 ) ♦EPS*N( 0, C**2 ) 

EPS/C 

1 

3 

10 

30 

100 

0 . 0800 

1.2395 

1.4087 

1.4794 

1.5007 

1.5061 

0.1600 

1.2395 

1.6100 

1.7855 

1.8409 

1.8557 

0.2400 

1.2395 

1.8518 

2.1838 

2.2945 

2.3256 

0.3200 

1.2395 

2.1455 

2.7140 

2.9160 

2.9761 

0.4000 

1.2395 

2 . 5064 

3.4395 

3 . 7973 

3 . 9099 

EPSMAX*0. 

500  K(EPSMAX)«  0.436 

VMAX(K) 

=5 . 928 

F-<1 

-EPS)*N<0, 

,  1  )*EPS*N(0,i 

0**2 ) 

EPS/C 

1 

3 

10 

30 

100 

0.1000 

1.2916 

1.5086 

1 . 5999 

1.6267 

1 . 6294 

0.2000 

1.2916 

1.7789 

2.0195 

2.0949 

2.1047 

0.3000 

1.2916 

2.1211 

2 . 6094 

2.7747 

2.8021 

0.4000 

1.2916 

2.5626 

3.4734 

3.6124 

3 . 8808 

0.5000 

1.2916 

3.1*47 

4.8072 

5.5038 

5.6726 

i 
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Table  5.2  ratio  of  estimation  variances  between  designs  with  fixed  k(emax=0.5) 
and  variable  k  (e), Gauss-Gauss  mixture. 


VAR(K(EPSMAX) )/VAR(K(EPS) )  EPSMAX=0.500  K(E  PSMAX)=  0.436 


F=( 1-EPS) *N(0 , 1) +EPS*N( 0, C**2 ) 


EPS/C  1  3  10  30  100 

******************************************** ***************** 


0.0010 

1.2897 

1.2868 

0.0050 

1.2832 

1.2734 

0.0100 

1.2762 

1.2605 

0.0500 

1.2340 

1.1963 

0. 1000 

1.1946 

1.1482 

0.2000 

1.1337 

1 . 0880 

0.3000 

1.0839 

1.0493 

0 . 4000 

1 . 0420 

1.0224 

0 . 5000 

1.0000 

1.0000 

1.2838 

1.2828 

1.2824 

1.2645 

1.2617 

1.2604 

1.2471 

1.2428 

1.2409 

1.1687 

1.1600 

1.1547 

1.1162 

1.1062 

1.0987 

1.0582 

1 . 0484 

1.0394 

1.0271 

1.0194 

1.0117 

1.0099 

1 . 0040 

1.0031 

1 . 0000 

1 . 0000 

1.0000 

Table  5.3  Estimation  variances  for  Gauss-Lapiace  mixture,  a)  Fixed  k  (emax). 


b)Variable  k(e) 


EPSMAX=0. 500  K( EPSMAX )=  0.436  VMAX( K )=5 . 928 


EPS/C 

F=(1 

-EPS)#N(0, 

1  )+EPS#LAPLACE( 0, C##2 ) 

1 

3 

10 

30 

100 

0. 1000 

1 . 1863 

1.4343 

1.5708 

1.6172 

1.6379 

0.2000 

1.0922 

1.5996 

1.9403 

2.0676 

2. 1252 

0.3000 

1.0076 

1.7924 

2.4425 

2.7139 

2.8392 

0.4000 

0.9314 

2.0189 

3.1478 

3.6848 

3.9411 

0.5000 

*##*###•* 

0.8626 

2.2874 

4. 1795 

5.2347 

5.7638 

VARIANCES  WITH 

OPTIMAL  K( EPS) 

F=(1 

-EPS )*N( 0, 

1 )+EPS*LAPLACE( 0, C**2  ) 

EPS/C  1  3  10  30  100 

######  #####4MMMMMMMHMMMMHMMHMMMMMMHMMHMMMMMMMMHMMMMMMHMMMMMMMfr 


0. 1000 

1.0438 

1.2455 

1.3979 

1.4570 

1.4799 

0.2000 

1.0318 

1.4711 

1.8222 

1.9647 

2.0244 

0.3000 

0.9914 

1.7138 

2.3675 

2.6536 

2.7837 

0 . 4000 

0.9327 

1.9821 

3.1101 

3.6591 

3.9318 

0.5000 

0.8626 

2.2874 

4.1795 

5.2347 

5.7638 
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5.2  Extension  of  the  SSQME  Test  for  Unknown  Scale  of  the  Nominal 
Density 


The  scale  a  of  the  nominal  density  in  the  e-mixture  family  affects  the  implementa¬ 
tion  and  performance  of  the  SSQME  test  in  two  ways.  First,  a  must  be  known  for  con¬ 
struction  of  the  optimal  robust  M-estimates,  as  the  break  points  of  the  nonlinearity  are 
functions  of  it,  Eq.(4.14);  e.g.,  they  are  equal  to  ±crk(e)  for  the  nominal  Gaussian  case. 
Second,  the  detection  threshold  setting  <(/0)  must  be  proportional  to  V(l0,qQ  )  to  obtain 
the  desired  a0  level,  (see  Table  4.1),  and  V  itself  is  proportional  to  cr2.  Even  if  the 
optimal  M-estimators  are  replaced  by  some  other  robust  and  scale  invariant  estimators 
of  the  quadrature  locations,  the  second  crucial  problem  must  still  be  addressed. 


In  the  specific  case  of  nominal  Gaussian  density,  which  is  the  main  interest  of  our 
work,  the  first  problem  is  solved  together  with  a  substantial  simplification  in  implemen¬ 
tation  complexity  by  utilizing  Tukey’s  alpha-trimmed  mean  estimator  instead  of  the  M- 
estimator.  This  estimate  is  defined  by: 


1 


n  (l-2a) 


n  -l-[n  a] 

E  *(i)  +  (H-[»  Qf]-»  Of)[*(1+|«  aj  x (n-\n  a|)l 
» =2+[n  a | 


(5.2) 


where  !(,■)  are  the  ordered  samples:  ®(1)<i(2)< . x ),  and  [n  aj  is  the  greatest  integer 

in  n  a  for  0<a:<0.5.  The  estimator  simply  deletes  [nor]  samples  from  each  end  and  then 
takes  the  weighted  mean  of  the  remaining.  Its  advantage  over  Huber’s  M-estimator  is 
that  solving  an  implicit  nonlinear  equation  is  not  necessary.  With  the  above  weighting  it 
is  translation  invariant  for  all  n  (and  unbiased  when  Ef  (i,  )  exists).  Bickel  [33]  showed 
that  it  is  asymptotically  Gaussian  and  its  variance  for  symmetric  f  is 


Var  (yfri  X a) 


(l-2o)2 


f  x2f  [x)dx  +ax  i_a 


(5.3) 


where  F  (xi_a)= 1-a.  The  two  estimators  are  quite  similar  for  proper  choice  of  a  and 
k  (e);  they  both  sum  linearly  the  main  bulk  of  the  samples,  but  Huber’s  estimator  treat 


-  57  - 


large  samples  and  outliers  by  limiting  them,  while  the  a-trimmed  estimator  censors  them 
completely.  The  main  difference  is  that  for  small  n  the  a  -trimmed  will  always  discard 
some  of  the  observations,  even  if  they  all  are  small  and  come  from  the  nominal,  while 
the  other  might  not.  For  n  — ►oo,  however,  they  are  equivalent  in  quadratic  mean.  The 
variance  of  Huber’s  estimator  with  the  soft-  limiter  nonlinearity  l0(x  ,-k  ,k  )  is 

k 


2  [jx2f(x)dx  +k2F(-k)\ 


V(l0,f  )  = 


(5.4) 


[1-2  F(-k)]2 

This  is  identical  to  (5.3)  when  one  choses  a(e)=F  (-k  (e)).  Hence  they  both  have  the 
same  asymptotic  distribution  and  minimax  property  so  that  the  scale  invariant  a- 
trimmed  mean  can  be  substituted  in  the  SSQME  test  without  change  in  the  derivation. 

Two  different  asymptotically  scale  invariant  extensions  of  the  SSQME  test  are  next 
discussed.  The  difference  between  them  depends  on  the  availability  of  a  “noise-reference” 
channel. 


a)  “Sliding- Window”  SSQME  test 

We  follow  the  definitions  and  the  physical  setup  of  section  2.2.  In  addition  to  the 
“test-cell”  samples  {x,y},  there  are  also  available  M  “noise-  reference”  vectors  of  obser¬ 
vations:  {uij,Vij},i  =1  ,...n  ,j  =1  u  j  £{uij}in=hj  =1  ,...M  are  statistically 

independent  vectors  that  are  identically  distributed  as  x  is  under  H0,  and  likewise  for 
Vj  and  y.  An  adaptive  threshold  test  is  constructed  by  using  the  SSQME  structure  in 
both  the  “test-cell”  statistic  R  (x,y)  and  in  the  threshold  estimator  W(u,v)  of  Eqs.(2.8)- 
(2.11).  Specifically,  the  test  is 

DECIDE  Hl  iff  Ra(x,  y)  >  t{M)Wa(  u.v)  (5.5) 

where  R  a(x,y)  is  identical  with  Tn(l0)  of  Eq.(4.3)  with  the  M-estimators  Ax  and  Ay 

replaced  by  the  a-  trimmed  means  of  the  x  and  y  samples,  respectively.  In  a  similar 

M  _  _  _ 

manner,  W a(u,v)—(l/M)  Wja,  where  Wj  a=  U fa  +  V fa  J  Uj  a  aRd  Vj  a  are  the  a- 

f= l 
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trimmed  means  of  u y  and  Vy,  respectively.  The  test  structure  is  shown  in  Fig.  5.1. 


Fig.  5.1  Sliding- Window  scale  invariant  SSQME  robust  test 


TEST- CELL 
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Under  H0,  for  n  — ►  oo  the  Wj\fj  and  R  are  not  only  identically  distributed  (this 
is  true  also  for  finite  n),  but  the  unknown  input  scale  appears  only  through  V(l0,q), 
which  is  again  a  scale  parameter  of  the  (one  sided  exponential)  distributions  of  these 
statistics.  Hence,  the  test  (5.5)  is  scale  invariant  and  CFAR:  a0  =  Prob  [R  >  t  W }  V 

q€Po  ■  It  is  important  to  emphasize  that  here,  unlike  the  fixed  threshold  SSQME  test, 
the  multiplier  t(M)  is  not  a  function  of  V(l0,q q),  so  that  the  same  o0  is  obtained  for  V 

'<7€Po-  Moreover,  the  expected  value  of  the  statistic  Wj  is  exactly  equal  to  the  variance 
V(l0,q )  of  the  estimators  in  the  “test-cell”  by  definition,  and,  in  addition,  when 
M  — +  oo  the  estimation  of  this  variance  by  the  “sliding-window”  sample  mean  VF(u,v) 
converges  almost  surely  to  the  true  value.  Hence,  the  test  (5.5)  will  be  asymptotically 
(M— *oo)  a.s.  equivalent  to  the  SSQME  test  when  a2  of  the  nominal  density  is  known. 
For  finite  M  the  false  alarm  probability  is  given  by  a0=(l+t(M)/M)~M  ,  and  the 
detection  probability  of  a  Rayleigh  amplitude  signal  by  (compare  [18]- [20]  for  the  pure 
Gaussian  case) 

0(lo,q)=  [l+t(M)/M(l+S(l0,q)]-M  (5.6) 

Here  S(l0,q)=  nA$  /2<?2V(i0,q)  is  the  effective  SNR  computed  at  the  optimal  non¬ 
linearity  (or  its  corresponding  trimming  ratio  a)  for  density  q  1.  Note  that  the  limit  of 
these  expressions  with  M —+  oo  are  indeed  the  exponential  probabilities  which  are  valid 
for  the  SSQME  test  as  in  section  4.3  case  a),  and  that  for  M  >16  the  detectability  loss 
incurred  by  the  adaptive  threshold  scheme  is  very  small,  c.f.  [18]-[20]. 

The  monotonity  of  f3(l0,q)  in  5(/0,g)  ensures  as  before  that  the  upper  bound  on 

the  variance  in  qE Po  is  translated  to  a  lower  bound  on  the  detection  probability  .  Note 

that  the  problem  with  the  Rayleigh  signal  of  section  4.3  case  a)  is  solved  here,  as  t{M)  is 

not  a  function  of  V(l0lq  g)  as  it  was  there.  Hence  /3(l0,q  )>0{lo,qo )  V  E  (A2)>0\ 

Moreover,  this  conclusion  remains  valid  for  the  various  amplitude  models  discussed  in 

1The  expression  for  deterministic  signal  amplitude  can  be  derived  as  in  [18]  with  the 
effective  SNR  as  given  by  (4.16a) 
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section  4,  and  in  general,  for  any  amplitude  p.d.f.  that  leads  to  a  monotone  increasing 
/ S(S  |  W)  in  S  for  the  known  scale  case.  (Since  0(S)=Ew0(S  |  W)  is  obviously  mono¬ 
tonic). 

We  can  summarize  this  discussion  in  the  following  proposition: 

Proposition  5.1  Let  Dj^  be  the  class  of  all  hypothesis  tests  of  the  form 
R(TX,  Ty)>  ty  W{  ru^ ,  ,j  —  1,  ‘  '  '  M)  where  W  and  R  are  as  in  Eq.  (5.5),  with 
the  a-trimmed  estimators  replaced  by  any  other  translation  invariant  statistics  T ,  and 
let  d  (a)  be  the  test  with  a-trimmed  estimators.  Let  Dm  be  the  class  of  all  deci¬ 

sion  rules  based  on  { Tx,Ty }  -  i.e. ,  without  imposing  the  structure  of  Eq.  (5.5)  and  pre¬ 
cluding  reference  samples.  Then  -- 

a)  The  test  is  CFAR  for  any  gGP^  and  Pj a=(l+t(M)/M)'M . 

b)  If  a* (e)=Fo(~k(e)),  where  Fq  is  the  c.d.f.  of  q q,  q$  and  d*(a*(e))  are  a  saddle- 
point  pair  in  (P0(T,Dm)  for  the  detection  probability 

0  (qt.d)  <  P(qZ,d*)  <  0  (q  ,d* )  (5.7) 

d€DM  ?£Po» 

i.e.,  the  test  is  asymptotically  maximin  robust.  Moreover,  as  e-+0,  0{q^  ,d*)—*  mono- 

tonely  to  the  detection  probability  of  the  best  scale  invariant  test  against  pure  Gaussian 

noise.  Thus,  small  deviations  from  the  assumed  nominal  model  only  slightly  degrade  the 

performance.  0{q^,d*)  is  given  by  Eq.  (5.6)  evaluated  at  q$  . 

c)  When  also  M—*oo,d*  is  equivalent  to  the  Neyman-Pearson  test  in  the  wider  class 

for  9q(<7=I)  -  i.e.,  assuming  that  <r  is  known  and  only  {Tx,Ty}  are  available  as  observ¬ 
ables. 

For  the  general  case  of  a  non-  Gaussian  nominal  density  /  0,  the  a-  trimmed  mean 
is  not  the  optimal  estimator,  although  it  is  still  robust  in  Hampel’s  sense  as  will  be  dis¬ 
cussed  in  the  following.  All  the  previous  conclusions  remain  valid,  if  the  a  -trimmed 
estimators  are  replaced  by  the  optimal  L-estimators  for  the  same  least  favorable  p.d.f. 
of  Eq.  (4.13).  The  location  invariant  and  scale  equivariant  L-estimators  are  given  by 
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Ln(x)=  am- £(,-),  etc.,  where  the  weights  are  generated  from  a  score  function  m(s) 
according  to 


i  jn 

«».•  =  f  m(s)ds 

(«-!)/" 

and  m(-)  is  related  to  q q  of  Eq.(4.13)  through 


(5.8) 


where 


_J _ 

/(F0*)  rfx2 

0 


(-log/0(i;l)) 


|  x  |  <  Ij 

otherwise 


(5.9) 


/  0(i1;l)/A:-F0(-a;1;l)=e/2(l-e)  and  |  /  0'  (x^l)//  0(jc  iJl)  |  =k  .  Notice  that  also  here 
the  censoring  percentage  is  The  minimax  relation  on  the  estimator’s  variance 

remain  valid  (where  V(q^  )=l//(<?o  ))>  ^ue  to  the  asymptotic  equivalence  between  L 
and  M-estimators  [Jaeckel,  46],  provided  the  ani  are  chosen  as  above.  (Stigler  [47]  proved 
later  that  if  the  nominal  p.d.f  is  any  symmetric,  twice  differentiable  and  strongly  unimo- 
dal  p.d.f.  on  R1,  the  L-estimator  corresponding  the  above  score  function  is  indeed 
asymptotically  Gaussian  with  the  variance  assumed  in  [46]). 

To  summarize,  the  SW  adaptive  threshold  extension  of  the  SSQME  test  is  a  suit¬ 
able  choice  for  radar-sonar  systems  where  “noise-reference”  samples  are  conveniently 
available.  It  possesses  all  the  asymptotic  desired  maximin  properties;  its  implementation 
in  the  nominal  Gaussian  case  is  relatively  easy  based  on  a-trimmed  estimators  which  do 
not  require  more  than  ordering  of  the  data,  an  operation  that  is  common  to  most  non- 
parametric  tests  and  is  becoming  available  for  real  time  implementation  with  VLSI  tech¬ 
nology. 


b)SSQME  test  with  a  preliminary  scale  estimate 

We  assume  now  that  {x,y}  are  the  only  available  observables,  and  the  nuisance 
scale  estimate  must  be  derived  from  the  “signal  plus  noise  ”  data  itself.  We  draw  and 
build  on  Huber’s  theory  [3]  of  simultaneous  estimation  of  location  and  scale.  For 
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completeness  of  the  presentation,  we  next  quote  and  make  some  interpretations  on 
Huber’s  main  results: 

i)  Simultaneous  M-estimates  for  location  and  scale  are  a  pair  of  statistics  (Tn  ,Sn) 
determined  by  solving  equations  of  the  form 


n  r-  -T 

E*(-4r^-)  =  0 


«=1  $n 


n  x-  -T 

Ex(-V^-)  =  o 


i=l 


(5.10a) 


(5.10b) 


These  are  similar  to  the  ML  estimates  of  9  and  cr  for  a  family  of  densities 

— /  ( - ),  with  proper  selection  of  #  and  y.  As  most  common  test  statistics  and 

a  a 

estimators  depend  on  the  samples  only  through  the  empirical  distribution  function 
Fn  (x  ;  x  )  =  —  E^{i,  <z  }>  where  I  {a}  is  the  indicator  function  of  the  set  A,  it  is 


convenient  to  express  T„  (x)  =  T(Fn)  and  Sn  (x)  =  S(Fn)  in  terms  of  the  func¬ 
tionals  defined  by: 


=  o 

!x[^P-)F(dx)  =  0 


(5.11a) 

(5.11b) 


Hampel’s  [34]  influence  curve  (IC)  is  a  very  useful  heuristic  tool  of  robust  statistics, 
which  describes  the  (suitably  normed)  limiting  influence  of  a  single  observation  on 
the  estimator.  Intuitively,  a  qualitative  robust  estimator  must  have  a  bounded  IC. 
If  \fn  [T(F„)  -  T(F)]  is  asymptotically  zero  mean  Gaussian,  its  variance  is  given 
by  (using  (5.13)  below)- 


V(F,T) 


MsTFT}F(dx) 

f!C2(x  ;F  ,T)F (dx  )  —  52(F)-f - - 


(5.12) 


f*  (WJ)f(dx[ 

with  a  similar  expression  for  V(F  ,S).  If  (and  only  if)  F  and  x  are  symmetric,  and 
#  is  skew-symmetric,  the  IC’s  of  the  simultaneous  estimates  are  uncoupled;  in  par- 
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ticular,  only  S(F)  but  neither  the  IC  nor  the  asymptotic  variance  of  S  enters  into 
the  expression  of  IC(T).  The  IC’s  are  given  by: 


IC(x;F,T)  = 


IC  (x  \F  ,S )  = 


*(■ 


S(F) 


)S{F ) 


J>  ( 

X( 


S(F) 


)F(dx ) 


S[F) 


)S(F) 


fx'  ( 


— — ) — - —  F(dx) 
S(F)  S(F)  K  1 


(5.13) 


(5.14) 


Note  that  for  a  given  distribution,  T(F)  and  S(F)  are  numbers;  Tn  and  Sn  are 
Fisher  consistent  estimates  if  T(F)  and  S(F)  are  equal  to  the  estimated  parameters. 
Under  the  above  mentioned  symmetry  conditions,  the  estimates  are  asymptotically 
uncorrelated  and  hence  independent. 


ii)  The  asymptotic  robustness  (in  Hampel’s  sense  )  is  clear  from  (5. 13)-(5. 14),  provided 
that  'I'  and  x  are  bounded  functions.  In  section  6.4  of  [3]  it  is  shown  that  under 
relatively  mild  conditions  the  coupled  estimates  are  consistent 

P 

(Tn  ,Sn)  — ►  (T  ( F),S  (F))  and  jointly  Gaussian. 


iii)  The  IC  of  T  for  the  case  when  the  nominal  scale  is  known  is  given  by  the  same 
expression  as  (5.13)  but  with  S(F)=1.  If  S(F)  does  not  change  too  much  when 
FGPo  (a  property  which  must  be  satisfied  for  any  reasonably  consistent  and  robust 
estimate  of  the  scale),  the  desired  properties  of  the  location  estimate  with  known 
scale  will  be  roughly  preserved.  However,  it  is  not  possible  to  obtain  an  exact 
minimax,  since  the  loss  of  the  translation  invariance  symmetry  in  the  multi¬ 
parameter  case  does  not  enable  extension  of  the  parametrization  of  the  nominal 
model  throughout  a  convex  uncertainty  neighborhood,  c.f.  [3,  section  11.1],  It 
should  also  be  clear  that  the  variational  techniques  of  [3]  are  not  applicable  for 
finding  the  density  that  minimizes  V(F  ,T),  even  when  the  asymptotic  variance  of 
S  is  considered  as  a  nuisance  factor,  due  to  the  deep  and  implicit  nonlinear  coupling 
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of  Eq.(5.12)  subject  to  Eqs.(5.1l). 

iv)  The  simultaneous  solution  of  equations  (5.10)  is  perhaps  an  unnecessary  compli¬ 
cated.  Simplified  variants  are  the  one  step  M -estimates.  They  are  obtained  by 
starting  with  some  preliminary  estimates  Tn ^  and  S^°\  and  then  solving  Eq.(5.11a) 
approximately  by  applying  Newton’s  method  just  once  .  The  following  variant 
emerged  in  the  extensive  “Princeton  Monte-Carlo  study  ”  of  Andrews  et.al.  [35]  as 
very  robust,  even  for  small  sample  sizes.  The  preliminary  estimates  are  the  median: 
T^°\x)  —  med{xi},  and  the  median  of  the  absolute  deviations  from  it  (MAD): 
5j°\x)  =  med  {  |  x j-Tj°\x)  |  }/0.6745,  where  0.6745=4>_1(3/4)  to  get  consistency 
at  the  Gaussian  (x(i)— |  x  |  -1),  thus  5^°\F)=F_1(3/4)  for  asymmetric  den¬ 
sity  from  (5.11b)).  The  MAD  is  the  limiting  case  (e— *T)  of  the  minimax  robust  scale 
estimate,  and  its  computation  is  simpler.  Then, 


1  X;  —  Tj®  ,  ,nl 

1  *  n  (0) 


Tnm(x)  =  r,(0)(x)  + 


-)S," 


i  x-r  w 

1  5  (°)  ’ 


(5.15) 


Tn^  is  asymptotically  (n  — »oo)  equivalent  to  the  full  solution  of  Eq.(5.10a)  T^00^, 
provided  the  previously  mentioned  symmetry  conditions  are  satisfied.  Moreover, 
with  these  symmetry  conditions,  any  one-step  estimator  with  translation  invariant 
and  odd  Trj°'1  :  TJ-0\x+c)=TJ-0\x)-\-c  ,T^°\-x)=-T^°\xj),  will  have  the  same  IC 
(and  hence  identical  asymptotic  properties)  as  the  full  solution  of  the  coupled  equa¬ 
tions.  Note  that  for  the  nominal  Gaussian  case  the  computation  of  (5.15)  is  of  the 
same  order  of  complexity  as  of  the  nonlinear  detectors  typical  of  the  LORD 
approach  (section  1.3),  provided  Tn ^  and  Sn^  are  available  after  two  orderings  of 
the  data  (the  denominator  is  just  the  number  of  normalized  samples  in  -k  ,/r  ]). 


With  this  background  ,  it  seems  natural  to  preserve  the  structure  of  the  SSQME 
test,  but  with  scale  invariant  location  estimators  and  an  adaptive  threshold  which  is 
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derived  from  the  scale  estimate.  Thus  we  propose  the  following  test: 

DECIDE  Hx  iff  R(x, y)  >  tn  W(x,y)  (5.16) 

where 

S(x,y)  £  !r,(1|(x)|2+[r„<1*(y)]2  ,  W{x,y)  £  [S„<0>(x)]2+[S„<l»(y)|2  (5.17) 

T^11  is  the  one  step  M-estimator  of  Eq.{5.15)  and  S„*0)  is  the  MAD  scale  estimator.  The 
structure  of  the  test  is  shown  in  Fig.  5.2. 

Fig.  5.2  SSQME  test  with  MAD  preliminary  scale  estimate. 
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The  asymptotic  properties  of  the  test  are  summarized  in  the  following.  They  are 
valid  for  all  simultaneous  and  one  step  M-estimators  provided  the  symmetry  properties 
that  were  mentioned  before  hold. 

Proposition  5.2  a)  The  test  (5.16)  is  CFAR  for  any  fixed  density  /  £P0  with  unk¬ 
nown  (variable)  a.  b)  With  variable  f  ,  the  changes  in  the  asymptotic  false-alarm  proba¬ 
bility  are  governed  only  by  the  normalized  variance  of  the  location  estimates  : 
V'  (F  ,T^)  ~  V(F  ,T^)/(S^0\F  ))2  ,  but  not  by  the  variance  of  the  scale  estimate,  c) 
The  asymptotic  detection  probability  is  given  by  the  same  expression  as  for  a  fixed 
threshold  SSQME  test:  ft(F  )=a(F  )1/(1+5Wfi  (^))  (  for  a  Rayleigh  signal),  where 

tiA  ^ 

SNR(F)  =  - ^ - -  (5.18) 

2 [S^\F)}2V  (F,T(1)) 

is  also  not  a  function  of  the  scale  estimator  variance. 

Proof  1)  Both  r„W  and  Sn ^  are  scale  invariant:  5^0|(crx)— aS„(0\x),  etc.,  from 
which  a)  follows  immediately.  Also,  S^F^)  =  cr5'^0\F(r=1),  hence  we  will  omit  the  scale 
subscript  on  F  for  convenience. 

00  00 

2)  The  probability  of  acceptance  of  H x  is  given  by  J  f  w(w  )dw  J  f  R{r)dr  ,  since  R  and 

o  („u- 

W  are  asymptotically  independent  when  the  symmetry  conditions  of  i)  above  are  met.  R 
is  one  sided  exponentially  distributed,  with 

E0{R)  =  !(S(°)(F))2F'  (F,T{1))  ,  E^R)  =  E0{R)[1+SNR]  (5.19) 

71 

by  straightforward  calculation.  Note  that  the  definition  of  SNR  is  justified  (by  analogy 
to  the  classical  case)  as  V'  (F  ,T^)  is  also  a  invariant  by  its  definition,  as  long  as  the 
scale  and  location  estimates  are  also  invariant.  Hence  the  input  noise  variance  affects 
SNR  only  through  S^F (T)=aF<'°\F  j).  As  W  is  the  sum  of  the  squares  of  two  indepen¬ 
dent  identically  distributed  Gaussian  r.v.’s  N(a,v‘2),  it  has  a  noncentral  distribution 
with  2  degrees  of  freedom  and  noncentrality  parameter  \=2 a2,  c.f.  [13].  Explicitly, 
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,  ,  s  1  I  w+2a2 , ,  ,  a  \/2w  ,  ,  nc., 

exp( - - - )/0 - )  5.20 

2v  2v  v 

where  I0  is  the  modified  Bessel  function  of  zeroth  order.  Here  a=S,^(/r)  and 

v=—  V(F,S^)  under  H0  and  under  Hlt  as  Sj°\x+c  )=STj°\x).  The  desired  probabili- 
n 

ties  are  thus  computed  from: 

p.  =  exp(-»  /v }  ]Io{±j!L)iw  ,  i=Ha,Hx(. 5.21) 

The  evaluation  of  this  integral  is  facilitated  by- 

OO 

fexp(-x)I0(2\/yx  )dx  =  exp (y)  (5.22) 

o 

which  can  be  derived  by  a  change  of  variables  from  the  normalization  of  the  Rician  den¬ 
sity.  The  result  is 

«p[-^“(g)(1+2t„  v  /£.  Ill ))"' 

Pi  “  1+2 t.v/Ei(S) 

As  a2/Ei  (R)  n  and  v  /E{(R)  is  not  a  function  of  n  ,  in  order  to  get  a  non-zero  a  as 
n—*oo  we  must  have  tn—t/n.  Thus  when  n  — *■00  the  terms  that  contain  V(F  ,S^°\F )) 
through  n  cancel  out  and  lim  p0  —  exp (-t/V1  [F  ,T l1))  follows  and  proves  b).  The 

n  — ►oo 

proof  of  c)  is  similar  upon  substitution  of  Ei{R)  from  Eq.(5.19)  into  (5.23)  and  taking 
the  limit  as  n  —►00. 

Comments  and  interpretation 

l)Since  the  error  probabilities  are  not  functions  of  the  variance  of  the  scale  estimator, 
should  be  chosen  to  achieve  better  and  flatter  consistency  over  P„ ,  and  not  accord¬ 
ing  to  its  variability.  2)  The  false  alarm  probability  with  the  preliminary  estimate  is 
parametrized  by  V'  (F,T^),  which  is  roughly  equal  to  the  variance  under  the  known 
scale  case,  if  5*°\ir')=l  and  it  does  not  change  too  much  over  P0.  Hence  we  have  a  simi¬ 
lar  situation  as  in  Chapter  4,  and  the  upper  bound  on  the  false  alarm  probability  is 
approximately  satisfied  if  the  least  favorable  density  is  chosen.  While  it  does  not 
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seem  possible  to  give  an  analytic  characterization  of  the  density  that  maximizes 
V'  (F  ,T^)  over  the  e-mixture  family  P0  as  was  discussed  in  iii)  above,  if  the  noise  den¬ 
sity  is  restricted  to  a  narrower  family  (e.g.,  Gauss-Gauss  or  Gauss-Laplace  e-  mixtures), 
the  supremum  of  the  variance  can  be  found  (at  least  numerically)  and  adjusted  accord¬ 
ingly  to  obtain  an  upper  bound  on  the  a  over  the  restricted  family.  3)With  that  res¬ 
tricted  maximization,  an  approximate  upper  bound  on  the  detection  probability  will  also 


be  obtained  as  follows.  Denote  by  P0p  the  restricted  noise  family.  Since 
S^°\F  ,D  )=F'\3/4)/D  (for  the  MAD  estimator)  changes  over  P0p,  its  arbitrary  normal¬ 
ization  D  can  be  used  as  an  optimization  parameter.  Let  V**(D)=  MaxV'  (F,T^\D) 

P()r 

and  c*0=exp(-£  /  V**(D  ))>ar(F  ,D  ).  Under  the  same  approximation  of  section  4.3  for 


sufficiently  high  desired  probability  of  detection  (/?;>0.5),  with  substitution  of  SNR  from 
Eq.  (5.18): 


RF,D)  =  a{F  ,D  )lH1+SNR(F))  S  ag 


2| S(°\F,D  )] iV”(D)/nA* 


(5.24) 


Hence  the  additional  detectability  loss  from  scale  estimation  relative  to  the  worst  case 
where  o  is  known  is  characterized  by  the  effective  SNR  loss 


L{F,D)  = 


V(l0,q*) 


(5.25) 


and  D  **  can  be  chosen  to  minimize  the  maximum  loss  over  P0r 

D  **  —  ArgMin{  Max  L  (F  ,D  )}.  Hence,  a  (/3)  is  bounded  from  above  (below)  over  the 
D/ePor 

family,  and  the  additional  loss  due  to  scale  estimation  is  low  if  L  (F**  ,D  **  )~1. 


Figures  5.3a-5.3f  show  the  results  of  such  a  numerical  optimization  over  the  Gauss- 
Gauss  mixture  family,  vs.  the  r.m.s.  power  of  the  contamination  c  .  In  each  figure  the 
dashed  lines  are  of  V  (F  ,T^)/V** ,  and  the  continuous  lines  are  of  L  [F  ,D  ).  One  set 
of  graphs  in  each  figure  corresponds  to  the  optimization  with  respect  to  D  ,  while  the 
other  was  obtained  where  D  was  taken  to  get  consistency  at  the  least-favorable  density 
of  the  known  a  case,  i.e.,  D*  =F*<  i,(3/4).  Using  Eq.(4.10),  it  is  given  by 
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D  * 


k  — Lin 
k 


4(1— e)/  0(A: ) 


fo(k)  >  1 


<F 


-l 


1  +2(1-6) 
4(1  -  e) 


k  4(l-e) 

fo(k) 


(5.26) 


<- 


1 


k  4(l-e) 

The  figures  demonstrate  that  there  is  not  any  additional  loss  (above  that  incurred  from 
F(/0,?o  )/0'2)  from  fche  scale  estimation  for  e<0.2,  that  the  loss  is  negligible  (about  4%) 
for  €=0.3,  and  it  is  quite  small  (about  26%, corresponding  to  1  dB)  for  e=0.4.  Even  for 
the  huge  uncertainty  €=0.5  the  loss  is  reasonable  (50%)  if  c  <20.  For  e>0.5,  it  seems 
that  L(F  ,D  **)  is  unbounded.  This  agrees  with  the  ’’breakdown”  point  of  the  joint  esti¬ 
mation,  as  evaluated  by  Huber  [3,  pp.  141-146].  Also,  the  loss  difference  between  the 
optimal  D  **  and  the  normalization  D  *  is  quite  small.  As  D  **  >0.6745,  the  efficiency  at 
the  nominal  Gaussian  is  somewhat  sacrificed  to  obtain  better  performance  over  P0r 
(compare  with  Table  5.1).  Similar  conclusions  can  be  drawn  from  Figures  5.4a-5.4d 
which  are  valid  for  the  Gauss-Laplace  mixture  family.  When  the  nominal  p.d.f.  is  Gaus¬ 
sian,  the  test  of  Eq.(5.17)  can  be  simplified  by  utilizing  a-trimmed  estimators  in  place  of 
the  one  step  location  estimator.  In  that  case,  it  turns  out  that  the  agreement  with  the 
performance  of  the  known  scale  is  even  closer,  see  [48i. 
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;  a)  e=0.01 


.  b)  esO.l 


Fig.  5.3  Normalized  variance  V1  (F  ,T^)/V**  (dashed  lines)  and  effective  SNR  loss 
L  (F  ,D )  of  SSQME  test  with  a  preliminary  scale  estimate.  Gauss-  Gauss  mixture, 
c  2=contamination  variance.  D  *  -  scale  normalization  of  the  known-cr  least  favorable 
density,  D  **-  optimal  scale  normalization,  a)  e=0.01  . 
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O  a)  e=0.1  _  b)  e=0.2 

O  O 


Fig.  5.4  Normalized  variance  V'  (F  ,T^)/V**  (dashed  lines)  and  effective  SNR  loss 
L  [F ,D  )  of  SSQME  test  with  a  preliminary  scale  estimate.  Gauss-  Laplace  mixture, 
c  2=contamination  variance.  D  ‘  -  scale  normalization  of  the  known-cr  least  favorable 
density,  D  ** -  optimal  scale  normalization. 
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Finally,  we  note  that  the  undesired  result  where  the  false  alarm  probability  still 
depends  on  /  £P0  (unlike  the  SW-SSQME  version)  is  a  consequence  of  estimating  the 
scale  of  the  input  samples  and  utilizing  it  for  the  threshold,  instead  of  direct  estimation 
of  the  variance  of  the  location  estimator.  (Recall  that  in  the  classical  normal  case  the 
variance  of  the  sample  mean  estimator  differs  only  by  a  factor  of  n  from  the  variance  of 
the  input  samples).  This  might  be  resolved  with  additional  implementation  complexity, 
by  taking  the  adaptive  threshold  to  be  proportional  toU2(x)-f-  U2(y),  where 


f/2(x)  =  —  —  y  ^  F  (x)  T  (1)  ) 

l  )  n  n£i  5'°)(x)  '  "  j 


(5.27) 


and  the  IC  is  given  by  Eq.  (5.13).  Under  the  previously  stated  regularity  and  symmetry 
conditions  this  estimate  of  V [F  ,T^)  is  consistent.  Thus  it  is  reasonable  to  assume  that 
V  (F  ,T^)  will  cancel  out  from  the  error  probabilities.  However,  the  error  probabilities 


with  this  estimate  are  analytically  intractable  even  when  n  — *-oo  (see  also  [3],  section  6.8). 


We  feel  that  this  additional  complication  is  probably  unnecessary,  as  is  evident 
from  Figs.  5. 3-5. 4.  Moreover,  Monte-Carlo  finite  sample  simulation  results  indicate  that 
when  the  test  of  Eq.(5.16)  is  implemented  with  the  breakpoints  corresponding  to  the 
density  q$,  both  error  probabilities  did  not  deviate  in  any  noticeable  way  from  the 
known  a  case  (i.e.,  when  (5.12)  is  computed  with  5(F)=1),  when  e<0.2  and  n  >50. 
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5.3  Unknown  Signal  Frequency  and  Implementation  Complexity 

In  radar-sonar  detection,  the  frequency  of  the  signal  is  often  unknown,  since  it  is 
proportional  to  the  target  radial  velocity  by  the  Doppler  shift  effect.  In  the  Gaussian 
noise  case,  the  problem  is  commonly  solved  by  employing  a  bank  of  M  contiguous 
coherent  envelope  tests  ,  each  of  the  form  of  Eq.(2.4),  where  the  itk  is  matched  (by 
baseband  conversion)  to  a  Doppler  shift  of  i  fr/M  (fr  — l/T  =pulse  repetition  fre¬ 
quency).  If  the  Doppler  shift  is  spread  uniformly  over  [0 ,/r],  almost  optimal  performance 
over  this  range  is  obtained  by  implementing  the  contiguous  bank  with  a  weighted  FFT 
processor  of  length  n  ,  as  the  envelope  of  the  FFT  filters  outputs  is  almost  flat.1  This  will 
serve  in  the  following  as  a  basis  for  comparison  of  implementation  complexity  of  the 
various  robust  tests. 

In  a  similar  manner,  robust  tests  for  unknown  frequency  can  be  constructed  as  a 
bank  of  M  contiguous  SSQME  tests.  Specifically,  in  the  klft  channel,  k  =0,1, ...M-l,  the 
input  observables  of  Eqs.(2.5-2.6)  are  transformed  according  to 

Ii(k)=Re  [(*,•  +JVi  )exp(-j  2nik  /M )]  ,  Q({k  )=/m  [(*,-  +pm  )exp(-;'  2irik  /M )] 

and  then  M  parallel  SSQME  tests  are  performed  on  {I(fr  ),Q(k )},  see  Fig.  5.4a.  In  this 
way,  for  the  M  discrete  frequencies  kfr/M  in  the  uncertainty  range  [0,/r],  the  perfor¬ 
mance  will  be  optimal  as  before,  with  some  loss  for  straddling  frequencies.  The  question 
now  arises  if  this  loss  is  comparable  to  that  of  the  FFT  processor  when  M—  n  .  At  first 
glance  we  might  suspect  that  it  would  be  larger  due  to  the  nonlinear  processing,  but  it 
turns  to  be  identical  at  least  for  large  sample  sizes. 

Propositions. 3  Let  the  frequency  shift  of  the  signal  relative  to  the  down  conversion 
mixers  be  f  d  and  let  n -*■  oo.  Then,  the  detection  probability  of  the  various  SSQME  tests 
is  given  by  the  same  expressions  as  for  f  d  —  0,  where  the  effective  SNR  is  attenuated 
by  G  (f  d)  which  is  identical  to  that  of  the  FFT  processor: 


flatness  of  about  1  dB  is  achieved  with  Hamming  weighting. 
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G{fi)  = 


sin2(  n  nJi/  }  T  ) 

sin  \nf  d/fr) 


(5.28) 


The  proposition  is  proved  in  appendix  D;  the  key  property  is  an  asymptotic  normality 
result  for  M-estimators  of  a  sequence  of  r.v.’s  with  unequal  means.  The  implications 
should  be  clear:  not  only  that  it  is  not  necessary  to  construct  a  bank  of  more  than  n 
contiguous  SSQME  tests,  but  it  is  also  possible  to  apply  the  same  weighting  techniques 
that  are  employed  with  the  FFT  processor  to  get  a  flatter  performance  over  the  fre¬ 
quency  range  (at  a  price  of  reduced  resolution  and  reduced  SNR  at  /rf=0).  (See  appen¬ 
dix  D  for  the  details.) 


The  major  increase  in  complexity  stems  from  the  nonlinear  processing  of  the 
SSQME  test.  Thus,  the  number  of  operations  required  for  a  single  frequency  must  be 
multiplied  by  n  resulting  in  0(n2)  for  the  full  range,  compared  to  nlogn  for  the  linear 
FFT  processor  (in  the  following,  logn=log2n).  It  does  not  seem  possible  to  perform  some 
of  the  nonlinear  operations  before  the  n  frequency  conversions. 


For  purposes  of  simplicity,  we  assume  that  multiplications,  additions  and  comparis¬ 
ons  are  equally  costly.  Hence,  the  FFT  processor  requires  O  (5n  logn  +4n )  real  operations. 
The  SSQME  test  for  the  nominally  Gaussian  noise  is  based  on  a-trimmed  means.  If  ord¬ 
ering  the  data  is  done  by  the  QUICKSORT  algorithm  [39],  the  expected  number  of 
operations  for  an  i.i.d.  sequence  is  O (nlogn)  2.  Hence  a -  trimming  requires 
O(nlogn+  c  (a),l<c  (a)<n.  With  it  available,  the  MAD  scale  estimator  requires 
O  (n  logn +2n )  operations  in  addition,  and  the  1-step  estimator  for  the  non-Gaussian 
nominal  density  needs  O  (lln )  more,  assuming  that  the  evaluation  of  a  nonlinear  func¬ 
tion  is  done  by  interpolation  between  two  stored  values.  With  these,  the  order  of  com¬ 
plexity  for  the  various  tests  of  the  previous  section  is  easily  obtained  by  counting.  For 
the  SW  version  we  do  not  count  the  operations  in  the  adjacent  M  reference  cells,  as  the 
SW  is  performed  by  sequentially  repeating  the  algorithm  on  adjacent  spatial  cells  and  it 


2  QUICKSORT  is  a  random  algorithm,  it  can  require  up  to O  (n2)  operations. 
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is  only  required  to  keep  in  memory  the  outcomes  of  operations  that  have  already  been 
performed.  Table  5.4  summarizes  the  order  of  the  number  of  operations.  The  last  entry 
belongs  to  the  narrow- band  Wilcoxon  nonparametric  test  [40]  which  is  a  candidate  for 
comparison  with  the  robust  tests.  The  test  statistic  for  a  single  frequency  is 

RNBw(x,y)  =  [£  +  [Etf+(2/,')t%,-)]2  (5-29) 

1=1  1=1 

where  R+( x,)  is  the  rank  of  |  x,-  |  in  |  xx  |  ,  |  x2  |  |  xn  | ,  and  U(x,-)  is  the  unity  step 

function.  The  test  has  the  same  order  of  complexity  as  the  SSQME  test,  which  substan¬ 
tially  outperforms  it  as  will  be  shown  in  the  next  section. 


Type  of  Test 

Order  of  Complexity 

FFT  processor 

5n  logn  +  4n 

SW-SSQME  for  nominal  Gaussian  noise 

2n2logn  +  2n  (e  (a)+l) 

SSQME  with  preliminary  scale  estimate 

4n2(logn  +  1)  +  2n  (c  (a)+2) 

for  nominal  Gaussian  noise 

SW-SSQME  for  nominal  non-Gaussian 

4n2(logn  +  12)  +  8n 

noise 

SSQME  with  preliminary  scale  estimate 

4n2(logn  +  12)  +  lln 

for  nominal  non-Gaussian  noise 

Wilcoxon  NB 

2n2(logn  +  3)  +  4n 

Table  5.4  Order  of  complexity  for  various  tests  of  narrow-band  input  of  length  n. 


Figure  5  4a  Bank  of  n  SSQME  tests  for  unknown  Doppler  frequency 


1 
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5.4  Finite  Sample  Simulation  Results 

The  asymptotic  approach  to  robust  statistical  procedures  is  an  unavoidable  neces¬ 
sity,  as  the  n  — ►oo  assumption  allows  one  to  invoke  the  central  limit  theorem  in  order  to 
obtain  explicit  functionals  which  parametrize  the  performance  of  the  procedure.  Thus, 
optimization  by  some  criterion  becomes  possible.  Although  asymptotically  one  could 
find  true  optimal  procedures  by  estimating  the  underlying  distribution,  this  usually 
requires  a  prohibitive  number  of  samples  [49 j.  Thus,  the  central  reason  for  the  asymp¬ 
totic  formulation  is  the  hope  that  robust  estimation  and  testing  procedures  will  approach 
their  asymptotic  behavior  quickly.  The  “Princeton  Monte  Carlo  Study”  of  Andrews 
et.al.  [35]  showed  that  this  is  indeed  the  case  for  the  variance  of  M  and  other  robust 
estimators  of  location.  For  detection,  we  need  also  to  demonstrate  the  tail  behavior. 

In  most  communication  engineering  applications,  n  will  be  quite  small.  This  is  true 
particularly  for  radar  -  sonar  systems,  where  such  considerations  as  the  desired  non 
-ambiguous  detection  range,  angular  resolution  and  minimization  of  the  required  time  to 
search  the  detection  spatial  sector,  call  for  decreasing  n  .  Moreover,  we  have  seen  time 
and  again  through  this  work  that  detectability  of  coherent  signals  is  governed  (at  least 
asymptotically  )  by  the  integrated  SNR  nA  2.  In  active  systems,  this  product  is  propor¬ 
tional  to  the  average  transmitter  power,  which  usually  is  the  fundamental  constraint 
rather  than  the  peak  power  A  2.  Therefore,  at  a  given  time  period  the  same  detectability 
can  be  maintained  by  increasing  A  and  reducing  n  while  keeping  nA  2  constant,  thus 
achieving  the  other  important  systems  objectives.  As  a  result  of  that,  most  radar  sys¬ 
tems  are  designed  with  n  —O  (10-50)  or  even  smaller  for  long  range  systems.  This  simple 
observation  which  is  probably  known  to  any  practicing  engineer  has  been  totally  over¬ 
looked  in  theoretical  considerations,  and  the  usefulness  of  procedures  resulting  from 
asymptotic  analysis  remains  questionable  without  small  -  sample  performance  analysis. 

Unfortunately,  analytic  tools  for  studying  finite  sample  performance  are  not  avail¬ 
able.  Results  in  the  spirit  of  the  Berry-Essen  theorem  [37],  which  bounds  the  deviation 
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from  Gaussianity  of  the  distribution  of  the  sample  mean  estimator  are  non-existent  for 
M-estimators.  This  is  so  because  it  is  not  possible  to  obtain  an  analytic  expression  for 
the  finite  sample  moments.  (Notice  that  its  asymptotic  variance  was  derived  indirectly 
through  the  convergence  of  its  distribution).  We  must  resort  to  either  numerical  or 
simulation  studies  which  do  not  analytically  exhibit  the  rate  of  convergence. 

For  M-estimators  with  a  monotone  nonlinearity  /,  the  distribution  can  be  com¬ 
puted  from  an  n  -fold  numerical  convolution  since  Prob  {A  <a} 
—Prob  {Y^i{xi~a)< 0}.  For  our  SSQME  test  more  integrations  are  required  since 
R  =A  x2  +A  y  ,  and  A  and  <f>  have  to  be  averaged  out.  Field  and  Hampell  [42]  recently 
showed  how  to  alleviate  the  enormous  amount  of  computations  by  their  so  called  “small 
sample  asymptotics”  technique,  which  is  closely  related  to  Daniels  [43]  saddlepoint 
approximation.  Though  this  technique  was  shown  to  be  very  accurate  down  to  sample 
sizes  of  3,  even  in  the  extreme  tails,  it  is  still  numerical  in  nature  and  involves  a  substan¬ 
tial  programming  effort.  Therefore  we  have  not  proceeded  in  this  direction  but  per¬ 
formed  instead  a  Monte-Carlo  study  -  programming  is  relatively  straightforward  and 
flexible,  and  the  empirical  distribution  estimates  are  known  to  be  unbiased  and  converge 
to  the  true  distribution  with  a  large  number  of  repetitions. 

Heuristically,  the  SSQME  tests  based  on  robust  amplitude  estimation  should  con¬ 
verge  at  a  faster  rate  than  the  locally  optimal  robust  detector  (section  4.4)  based  on 
weak  signal  assumptions.  The  latter  incorporates  a  hard  “non  tracking”  nonlinearity; 
the  distribution  of  the  test  statistic  has  finite  support  (when  n  is  finite)  with  a  point 
mass  at  the  upper  end  of  the  interval.  In  contrast,  the  M-estimator  can  take  any  value 
on  R1  and  its  distribution  will  be  smoother  and  hence  converge  faster  to  Gaussian.  This 
is  most  significant  under  Hl  when  the  distribution  of  the  LORD  test  statistic  will  be 
mostly  concentrated  around  the  boundary  of  the  intervals  for  large  SNR. 

When  adopting  the  asymptotic  theory  testing  structure  for  finite  sample  size,  the 
amount  of  trimming  or  limiting  must  be  increased  if  very  small  Pfa  is  desired,  since  we 
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are  dealing  with  the  tail  of  the  distribution  where  deviations  from  theory  are  larger.  This 
is  explained  more  precisely  as  follows.  Define  Ak  =  {  number  of  outliers  in  either  of  the 
quadrature  channels  that  is  exactly  k,  and  less  than  or  equal  k  in  the  other  channel  }. 
Since  these  events  are  mutually  exclusive  and  their  union  is  the  certain  event  we  can 
write 

(5.30) 

Pfa  =Pr  {R  >  t  }=2,E  Pr  {R  >t  \Ak}P  (Ak )+  £  Pr  {R  >  t\  Ak  }P  (Ak  ) 

k  =0  £=2[na|  +  l 

Thus,  a  lower  bound  for  Pfa  is  the  second  term.  In  it,  the  number  of  outliers  exceeds 
the  trimming  capability  of  either  estimator;  hence,  if  t  is  fixed  according  to  asymptotic 
theory  Pr  {R  >t  |  Ak }  will  be  large  and  close  to  1,  and  approximately 
Pfa =  £  P(Ak).  Assuming  that  the  contaminated  samples  come  from  a  switching 

k  >2(n  a|+l 

model,  the  probability  of  having  k  outliers  out  of  n  in  either  channel  is  binomial: 
Pk  =  |  (l-e)n~*  ek  ,  and  due  to  independence  of  the  quadrature  samples 

P(Ak)  =  Pk2  +  2Pk  £/>,  (5.31) 

1=0 

The  following  table  exhibits  this  approximation  for  n=16  and  e=0.1.  With  the  asymp¬ 
totically  optimal  <*=0.164,  Pfa  can  not  be  smaller  than  roughly  1CT3,  while  a— 0.27  is 
sufficient  for  10-6;  simulation  results  have  validated  this  approximation. 


a 

.125 

— I.J.— 

.25 

.3125 

.375 

5 

7 

9 

11 

13 

£  P(At) 

_ 

3.37  10-2 

1.0110-3 

1.18  TO-5 

5.41  TO-8 

8.3610-11 

Several  versions  of  scale- invariant  detectors  of  section  5.2  were  studied,  for  the 
nominally  Gaussian  case.  They  will  be  denoted  Dl  -  D5  as  follows: 

(Dl)  SW-SSQME  test  of  Eq.(5.5)  based  on  a-  trimmed  estimators  with  the  number  of 


reference  cells  M=A 
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(D‘2)  like  Dl  but  M—*o o  (i.e.,the  estimation  of  the  variance  of  the  test  statistic  con¬ 
verges  to  the  true  value).  For  that,  the  reference  statistics  W (u,v)  of  the  Dl  simu¬ 
lation  was  accumulated  over  all  runs  to  produce  the  adaptive  threshold.  Thus,  a 
huge  saving  was  possible. 

(D3)  SW-SSQME  test  with  a  “one-step”  location  estimators  of  Eq.(5.15)  based  on  the 
MAD  scale  estimate;  its  normalization  factor  D=0.632  was  adjusted  to  get  (empiri¬ 
cally)  consistency  of  the  scale  estimator  ES ^  (x)=l  at  the  Gaussian  (this  value  is 
somewhat  lower  than  predicted  from  the  asymptotic  theory  ).  The  nonlinearity  was 
the  soft  limiter  /  (x  ,-k  ,k  ). 

(D4)  SSQME  test  of  Eqs.  (5. 16)-(5. 17)  -  no  reference  cells  -  based  on  the  same  location 
and  scale  estimators  of  D3  . 

(D5)  SW  version  of  the  LORD  from  section  4.4  (obtained  from  weak  signal  assumptions 
by  replacing  the  M  -  estimators  with  the  nonlinearity-integrator  of  Eq.  (4.29)), 
M— *■  oo. 

For  D1-D4,  both  a  and  k  were  adjusted  to  obtain  better  false-alarm  control  as  discussed 
before;  for  D5  we  chose  k  as  the  optimal  for  the  given  t  from  the  asymptotic  theory. 

For  each  test,  two  simulation  programs  were  built.  The  first  is  for  H0  and  its  out¬ 
put  are  graphs  of  the  false  alarm  probability  vs.  the  normalized  threshold,  as  well  as  the 
first  two  moments  of  the  location  and  scale  estimators.  The  second  program  is  for  Hv 
It  simulates  the  detection  probability  as  a  function  of  the  SNR  for  several  levels  of  the 
false  alarm  probability  and  for  a  Swerling  I  target  model,  i.e.,  coherent  narrowband  sig¬ 
nal  with  Rayleigh  distributed  amplitude  and  uniform  phase  (see  section  4.3,  case  a).  In 
each  of  the  simulations,  the  test  was  subjected  to  four  different  noise  p.d.f.’s  from  an  e- 
mixture  family  where  the  nominal  is  normal  Gaussian: 

(/ 1)  «=o 

(/  2)  <==  0-1  and  two  point  masses  at  ±15 
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(/  3)  €~  0-1  and  a  Gaussian  contamination  with  variance  c2=100 
(/4)  e=0.01  and  a  Gaussian  contamination  with  c2=900 

In  the  sequel,  we  shall  use  these  numbers  to  identify  the  situation.  The  input  variances 
are  thus  1,  23.4,  10.9  and  9.99  respectively  -  all  cases  represent  large,  heavy  tailed 
contaminations  and  would  have  resulted  in  a  substantial  degradation  of  performance  if 
the  unrobustified  envelope  detector  was  employed  (see  Section  2.2).  The  e-  mixture  r.v.’s 
were  generated  from  a  switching  device:  at  each  step,  a  normal  r.v  yt-  was  generated 
together  with  a  binomial  r.v  s,-  that  is  equal  to  1  with  probability  e;  Xj  =yf  if  Sj^l 
and  Xj  =  c  -yt-  (  or  x,=±15  with  probability  0.5  for  /  2  )  otherwise.  Almost  all  simula¬ 
tions  were  for  n  =16. 

Table  5.5  displays  the  Monte-  Carlo  variances  of  the  cc-  trimmed  estimator  for  vari¬ 
ous  ads.  For  comparison,  from  the  asymptotic  theory  at  the  least-favorable  p.d.f. 
V  *(e=0.1)  =1.49,  and  for  <*=0.225  we  computed  from  Eq.(5.3) 
V(l)=1.17,  V(2)=1.51,  V(3)=1.46,  V(4)=1.2.  This  good  correspondence  between 
asymptotic  theory  and  finite  sample  simulation  (for  n=20)  was  observed  before  by 
Andrews  et.  al.  [35].  The  asymptotically  optimal  trimming  ratio  is  a(e=0.l)=  0.164;  as 
was  discussed  before  it  is  necessary  to  apply  more  trimming  with  small  sample  size  when 
very  small  false  alarm  probabilities  are  desired.  The  simulations  indicated  that 
a=0.225  to  0.3  is  sufficient;  the  table  shows  that  except  for  /  t  for  this  a  range  the  vari¬ 
ances  are  roughly  as  those  for  a  =  0.164  (or  even  smaller).  The  results  suggest  that  the 
detectability  loss  compared  to  theory  is  small.  This  has  been  verified  from  the  simu¬ 
lation  results,  which  will  be  presented  below.  Similar  conclusions  can  be  drawn  from 
Table  5.6,  which  is  for  the  “one-step”  estimator;  here  the  asymptotically  optimal  limit¬ 
ing  is  A(e=0.1)  =  1.14,  and  very  small  losses  (  if  any  )  are  incurred  by  taking  k  =0.75. 
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Table  5.5  Monte-Carlo  variances  of  the  a-  trimmed  estimator.  n=16,  20,000  runs. 
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Table  5.6  Monte-Carlo  variances  of  the  “one  -  step”  estimator  with  MAD  scale  estima¬ 
tor.  n=16,  20,000  runs. 

Probability  of  false  alarm  curves  are  shown  in  Figures  5.5  -  5.16.  In  all  these 
figures,  the  smooth  curves  marked  “T”  are  computed  from  the  asymptotic  theory  - 
Eqs.(5.6)  and  (5.23),  and  the  numbered  curves  are  the  simulation  results  for  the  densities 
f  i~  f  4-  Fig.  5.5  corresponds  to  Dl  and  5.6  to  D2,  with  <*=0.165.  We  see  clearly  that 
here  a  is  too  large,  since  for  the  more  contaminated  cases  Pja  is  much  higher  than 
desired.  This  problem  is  solved  by  taking  <*=0.225  as  can  be  seen  from  Figs.  5. 7-5. 8.  In 
the  first  one,  the  Monte-Carlo  curves  are  all  remarkably  close  to  the  theoretic  curve, 
within  the  range  where  the  simulation  results  are  reliable  ,  Pra  ^>4 jnr  =2  10-4.  The 
test  is  clearly  CFAR,  for  very  different  noise  p.d.f.’s.  In  Figure  5.8  Pja  is  even  some¬ 
what  smaller  than  the  theoretic,  (this  means  that  the  distribution  of  the  test  statistic  R 
has  shorter  tails  compared  to  the  one  sided  exponential).  Thus,  if  the  threshold  is  fixed 
according  to  the  asymptotic  theory,  Pja  is  bounded  from  above  as  desired.  The  same  is 
valid  for  a=0.3  as  can  be  seen  from  Figs.  5.9-10.  The  curves  for  D3  are  shown  in  Figs. 

The  empirical  distribution  for  nr  repetition  of  the  Monte-Carlo  experiment 

Fn  (x  )= - numberlx,-  >X  },  is  a  binomial  r.v.  with  mean=l-E(x)  and  variance 

'  U r  _ _ 

= - F  (x  )(1-F  (x  )).  Therefore,  for  Pfa  «l,for  \j'  Var  (Pfa  )/Pfa  <0.5  one  obtains 

^.">4  fnr. 
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5.11-12;  in  the  first  one,  the  limiter  breakpoint  is  k  =1.14  according  to  the  asymptotic 
theory  -  it  is  again  seen  that  the  limiting  is  not  “hard”  enough,  as  Pfa  for  /  3  is  higher 
than  the  theoretic  bound  when  Pfa  <0.01.  This  problem  is  corrected  by  taking  k  =0.75 
as  can  be  seen  from  Fig.  5.12. 

Results  for  D4  are  displayed  in  Figs.  5.13-5.14;  notice  the  two  sets  of  curves  in  Fig. 
5.13:  the  first  is  of  \ogProb  {R  (x,y)>(t  jn )  V *  W  (x,y)}  where  ^*=1.49,  which  should 
approach  a  straight  line  when  n  — ►oo  according  to  proposition  5.2.  In  the  second  case  the 
scale  is  contracted  by  taking  F*=3.  Here  asymptotic  theory  is  no  longer  valid:  the 
curves  are  not  linear  and  for  achieving  small  Pfa’s  higher  threshold  settings  are 
required.  These  higher  settings,  in  turn,  would  cause  higher  detectability  losses.  This 
deviation  from  asymptotic  theory  is  not  surprising,  and  is  equivalent  to  the  behavior  of 
the  “Quadrature  t-test,”  which  is  the  maximum  likelihood  test  for  a  coherent  nar¬ 
rowband  signal  in  Gaussian  noise  of  unknown  variance 

Hx 

(—  E  )2  +  (—  E  sfi  )2  <  -  E(*  -  J)2  +  E( »  -  y )2]  (5.32) 

n  n  H0  n 

For  small  n  ,  the  right  hand  side  cannot  be  approximated  by  a  Gaussian  distribution  (it 
is  chi-squared  for  all  n  ),  and  also  the  dependency  between  the  right  and  left  sides  of  Eq. 
(5.29)  cannot  be  neglected  as  we  did  in  proving  Proposition  5.2.  In  Fig.  5.14  n  was 
increased  to  48,  in  order  to  check  convergence.  It  is  evident  that  the  curves  almost  con¬ 
verge  to  asymptotic  theory,  although  some  curvature  is  still  noticed. 

Finally,  Fig.  5.15  is  for  the  LORD  test  D5.  Here,  as  in  Fig.  5.8,  a  tendency  for 
shorter  tails  is  observed.  This  could  be  heuristically  described  by  the  hard  limiting  that 
restricts  the  distribution  of  the  test  statistic  to  have  a  finite  support. 

Dramatic  improvement  in  false  alarms  control  over  the  unrobustified  SW  detector 
Eq.  (2.8)  is  evident  when  comparing  the  previous  figures  with  Fig.  5.16.  When  it  operates 
in  a  Gauss-Gauss  (e=0.1,  c=10)  noise,  Pfa  is  intolerably  increased  from  the  design 
1CT4  to  about  0.1  when  M  =4,  and  from  10-6  to  2T0-4  when  M— ►o o.  Compare  also  Fig. 
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2.5  for  <5=0.01 .  Thus  we  can  confidently  conclude  that  the  various  SW-SSQME  tests 
essentially  maintain  CFAR  according  to  the  asymptotic  theory  of  section  5.2  even  for  n 
as  small  as  16  with  Pfa  down  to  (at  least)  10~4.  For  the  test  without  reference  samples 
D4,  n  is  not  much  larger  than  48.  Based  on  these  curves,  the  detection  thresholds  for 
the  Hy  simulation  were  simply  taken  from  the  smooth  theoretical  curves,  and  extrapo¬ 
lated  from  them  for  Pf a=l0~6.  Future  work  could  utilize  reduced  variance  sampling 
techniques  such  as  “importance  sampling  ”[44]  to  validate  this  extrapolation. 

Probability  of  detection  curves  for  Dl,  D2  and  D5  are  shown  in  Figs.  5.17-27.  Here 
5000  repetitions  are  sufficient  to  get  Var1//2(Pd)/Pi< 0.1  for  ,02<Pd  <0.98.  The  ordi¬ 
nate  is  the  effective  integrated  SNR  =nE (A2)/a2  where  a1  is  the  variance  of  the  nomi¬ 
nal  Gaussian  p.d.f.  The  known  signal  frequency  case  is  depicted  in  Figs.  5.17-20.  The 
Monte-Carlo  curves  are  clearly  bounded  from  the  left  by  the  computed  Pd  for  the 
unrobustified  SW  detector  when  e— 0,  and  from  the  right  by  the  maximin  lower  bound 
computed  from  asymptotic  theory,  Eq.  (5.6),  with  V*  of  the  least-favorable  p.d.f.  /  * . 
Here  again  the  fit  between  asymptotic  theory  and  small  sample  performance  is  striking: 
the  curves  for  /  x-f  4  are  ranked  exactly  according  to  the  estimation  variances  of  Table 
5.5,  and  the  simulated  Pd  values  generally  differ  by  no  more  than  a  few  percent  when 
the  simulation  variances  are  substituted  in  Eq.  (5.6)  or  (5.23).  Equivalently,  the  horizon¬ 
tal  differences  between  curves  are  roughly  the  dB  values  of  the  variances  from  Table  5.5. 
We  also  note  that  the  difference  between  a=0.225  and  a=0.3  is  very  small,  again 
according  to  the  variance  difference. 

In  contrast  with  the  above  results,  the  unrobustified  SW  detector  of  Eq.(2.8)  was 
found  to  suffer  substantial  detectability  losses.  They  were  almost  identical  to  the 
increased  input  variances  of  /  2  -  /  4,  with  respect  to  /  j. 

Some  non- parametric  schemes  were  developed  for  detection  of  narrowband  signals 
in  unknown  noise;  the  narrowband  Wilcoxon  detector,  Eq.(5.29),  was  proposed  by  Car¬ 
lyle  and  its  Monte-Carlo  performance  in  purely  Gaussian  noise  was  studied  by  Hansen 
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[40].  It  was  found  that  the  SNR  losses  compared  to  the  UMP  detector  for  n=16  and 
Pfa  =  1CT6  are  5  and  14  dB,  for  Pd  =0.5  and  0.9,  respectively  (smaller  losses  are  incurred 
for  larger  n  )  -  compared  to  merely  0.6  db  for  D2  in  Fig.  5.18  !  Furthermore,  the  perfor¬ 
mance  in  non-Gaussian  noise  has  not  been  studied. 

The  performance  of  the  detector  without  reference  samples,  D  4  with  n  =  16,  is 
shown  in  Fig.  5.20.  Although  convergence  to  the  asymptotic  theory  is  not  yet  reached 
(in  accordance  with  the  higher  required  threshold  values  -  recall  Fig.  5.13),  the  detector 
is  clearly  robust.  This  is  evident  from  the  small  differences  between  the  curves  for 
/  !  —  /  4-  By  comparison  with  Fig.  5.18,  the  SNR  losses  are  roughly  1.8  and  5.8  dB  for 
P ja  =  10-2  and  10"6,  respectively,  independent  of  the  Pd  .  However,  when  comparing  it 
with  the  SW-SSQME  test  Dl  of  only  4  reference  channels,  D4  outperforms  it  by  roughly 
1  and  3.4  dB,  for  P/a  =  10-2  and  1CT6,  respectively.  By  virtue  of  the  shift  invariance  of 
the  one-step  location  estimator,  if  convergence  to  Gaussianity  is  reached  under  H0,  the 
performance  under  Hi  will  agree  with  the  asymptotic  prediction.  Thus,  when  n  >  50 
D4  would  hardly  suffer  any  detectability  losses  compared  to  the  UMP  detector  for  the 
least  favorable  noise  p.d.f. 

In  contrast  to  the  excellent  performance  of  the  structures  proposed  in  this  work, 
Fig.  5.21  shows  that  while  the  LORD  test  D5  performs  roughly  the  same  for  P j  a  =10  , 
it  is  inf  trior  for  Pja  =10-6;  its  SNR  should  be  higher  by  4.6  dB  to  achieve  Pd=  0.9  at 
/  3.  This  can  be  simply  explained:  when  the  SNR  is  sufficiently  high  for  high  Pd ,  the 
“non-tracking”  hard  limiter  of  Eq.  (4.29)  prevents  the  distribution  of  R  from  tilting 
towards  higher  values.  Also,  asymptotic  theory  does  not  properly  describe  its  behavior 
at  the  most  interesting  zone  Pd— *-1.  (Recall  that  SNR  — < >0  was  necessary  for  the  deriva¬ 
tion  of  its  asymptotic  distribution,  but  not  for  the  tests  which  are  based  on  true  robust 
estimation  of  the  arbitrary  amplitude.) 

The  performance  with  unmatched  signal  frequency  when  a  bank  of  n  contiguous 
SW-SSQME  tests  is  utilized  for  unknown  frequency  (section  5.3  ),  are  depicted  in  Figs. 
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5.22-24  where  the  normalized  deviation  A /  Jkn  d  //r  =0.25,  and  in  Figs.  5.25-27  for 
A /  =0.5  (halfway  between  the  central  frequency  of  adjacent  tests  ).  The  mismatch  loss 
is  almost  indistinguishable  from  that  of  Proposition  5.3  -  i.e.,  the  same  loss  incurred  by 
the  FFT  processor  for  a  frequency  unmatched  signal  in  pure  Gaussian  noise.  While  this 
loss  is  reasonable  for  Dl  and  D2,  0.91  dB  and  3.9  dB  for  A /  =0.25  and  0.5  respectively, 
the  weak  signal  locally  -  optimal  test  D5  totally  breaks  down  under  this  situation,  as 
demonstrated  by  Figs.  5.24  and  5.27.  We  note  that  this  loss  for  Dx  and  Z)2  can  be 
reduced  by  weighting  the  input  samples.  (See  appendix  D.) 

Another  interesting  conclusion  can  be  drawn.  In  many  practical  cases,  the  noise 
p.d.f.  is  variance  constrained,  as  it  is  proportional  to  the  total  (  nominal  plus  contam¬ 
inating  components  )  noise  power.  This  is  true  in  particular  for  the  clutter  (reverbera¬ 
tion)  environment,  where  the  noise  power  is  proportional  to  the  radar  (sonar)  transmit¬ 
ted  power.  A  question  now  arises  as  to  which  variance  constrained  noise  p.d.f.  consti¬ 
tutes  the  worst  detection  environment.  Consider  for  example  Fig.  5.18  and  /  3  for  which 
the  detector  D2  is  essentially  optimal.  However,  the  variance  of  /  3  is  10.9.  Conse¬ 
quently,  in  a  Gaussian  noise  environment  of  equal  power  <72=10.9,  the  UMP  detector 
would  perform  worse  by  10.9  -1.5  =  9.4  dB  than  the  detector  D2  !  This  turns  to  be  true 
in  general.  In  [45),  we  show  that  the  Gaussian  is  the  least  favorable  variance  constrained 
p.d.f.  for  detection  of  lowpass  as  well  as  narrowband  coherent  signals  with  random 


parameters. 
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Fig.  5.5  False  alarm  probability.  Test  -  Dl  ,  o=.164. 
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Fig.  5.8  False  alarm  probability.  Test  -  D2  ,  a=.164. 
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Fig.  5.9  False  alarm  probability.  Test  -  Dl  ,  a=.3. 


Fig.  5.10  False  alarm  probability.  Test  -  D2  ,  a=. 3. 
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Fig.  5.19  Detection  probability.  Test  -  Dl,  ar=.3,A /  =0 
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Fig.  5.20  Detection  probability.  Test  -  D4.  n  =  16,  A /  =0 
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Fig.  5.23  Detection  probability.  Test  -  D2,  a— .225, A f  —.25 
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Fig.  5.24  Detection  probability.  Test  -  D5,  K  =l.l4,Af  =  25 
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Appendix  A 


Small  Sample  Performance  of  the  Sign  Detector 

The  sign  detector  (SD)  is  the  simplest  and  probably  the  most  popular  non- 
parametric  detector  for  a  deterministic  lowpass  signal  in  noise  with  uncertain  symmetri¬ 
cal  density.  Under  several  assumptions,  it  is  known  to  be  the  UMP  detector  [32]  or  the 
minimax  LORD  detector  [5]  for  those  particular  situations.  Under  the  asymptotic  weak- 
signal/large-sample  [n  ~^oo,H 0)  assumptions,  the  ARE  with  respect  to  the  linear 
detector  (LD)  at  noise  density  /  (•)  is  ARESD  ^4(af  (0))2.  This  is  0.637  for  Gaussian 
noise,  2  for  Laplace  (double  exponential)  noise  and  in  between  for  many  other  densities. 
Hence,  the  SD  is  generally  considered  to  be  very  efficient  for  many  situations  of  uncer¬ 
tainty. 

Small  sample  analysis  reveals  that  it  may  not  perform  as  well,  as  the  fine  details  of 
the  actual  small  sample-size  error  probabilities  are  lost  in  the  asymptotic  analysis.  The 
sign  test  is  given  by: 


r  (x)  =  £  u  (*,- ) 
»'=1 


>  t  >  Hi 

=t  ,  Hx  with  probability  c  (A.l) 

<  f  ,  H0 

where  «(•)  is  the  unity  step  function.  Notice  that  a  randomized  test  is  required  since 
T{x)  is  a  discrete  binomially  distributed  r.v.  Assuming  symmetry  of  /  (•),  the  false 
alarm  probability  is  given  by: 


“» = <!>*  i 


s 

k  =/  +1 


n 

k 


+  c 


t 


t  <n  -1 


(A.2) 


c  ,  t  =n 

When  n  is  small  and  the  desired  a0  is  very  small,  the  second  row  of  (A.2)  must  be 

chosen,  and  therefore  the  detection  probability  is  also  proportional  to  the  randomization 

constant  c  which  might  be  very  small.  Denoting  ProbHi{xi  >0}  =  p  ,  it  is  given  by 


Pd  =  cpn  =  otQ T  p  1 


< 


<*02" 


SNR  -co 


(A.3) 
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The  critical  values  of  the  sample  size  such  that  for  n  <nc  the  detector  becomes 
very  poor  (Pj  <0.5,  even  for  SNR  — ►oc)  are  found  to  be  12,  19  and  26  for 
a0  =  10-4,  10~6,  10~8,  respectively;  these  numbers  are  typical  of  many  practical  systems. 
A  similar  situation  is  encountered  with  the  Wilcoxon  detector,  as  its  minimal  false  alarm 
probability  (without  randomization)  is  also  2~n  . 
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Appendix  B 

An  Approximation  for  the  Distribution  of  the 
Envelope  Detector 

In  this  appendix  an  approximation  is  derived  for  the  distribution  of  the  coherent 

n  n 

envelope  statistic:  R  =  I2  +  Q2,  I  =  (l/n )  £  ,  Q  =  (1/n )  Yj  Vi  ’  where  x  and  y 

i  1 

are  i.i.d.  vectors  which  are  also  independent  of  each  other.  When  the  samples  are  Gaus¬ 
sian,  R  is  one  sided  exponentially  distributed.  Thus,  we  are  interested  in  an  approxima¬ 
tion  which  exhibits  this  exponential  limiting  behavior. 

Formally,  any  univariate  density  /  ( x )  can  be  expanded  in  a  series  of  orthonormal 
functions  {$t(:r)}  as 

00 

/  (* )  =  E  Ck  *t  (* )  (B  1) 

k  =0 

which  is  L  2  convergent  when  all  moments  of  /  exist.  A  classical  expansion  in  terms  of 
the  Gaussian  density  and  its  derivatives  (which  are  proportional  to  Hermite  polynomials) 
is  the  type  A  Gram-Charlier  series  [36],  or  its  reordered  version,  the  Edgeworth’s  form 
which,  when  applied  to  the  distribution  of  the  samples  mean,  gives  the  best  order  in 
powers  of  n~ 1//2.  Conditions  for  convergence  of  the  infinite  Edgeworth  series  are  rather 
technical  and  implicit  (they  require  conditions  on  /  which  is  usually  unknown);  more¬ 
over,  they  are  of  little  value  from  the  statistical  viewpoint  where  the  important  question 
is  whether  a  small  number  of  terms  can  furnish  a  reasonable  approximation.  In  particu¬ 
lar,  while  the  Edgeworth  expansion  of  the  sample  mean  converges  to  the  Gaussian  den¬ 
sity  with  finite  terms  as  n  — *  oo,  when  the  expansion  is  carried  for  finite  n  ,  adding  more 
terms  in  the  series  above  some  optimal  number  usually  results  in  a  worse  fit.  See 
[36, Chap.  6]  and  [37, Chap.  16], 

As  pointed  out  in  a  footnote  [36, section  6.24],  when  an  approximation  for  the  distri- 
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bution  of  a  positive  random  variable  is  desired,  it  is  plausible  to  use  a  family  of  functions 
which  are  supported  only  on  [0,  oo).  In  this  way,  the  error  from  the  tails  of  the  Gaus¬ 
sian  density  and  its  derivatives  at  x  <  0  would  be  eliminated.  Since  in  our  case  the 
limiting  one  sided  exponential  distribution  belongs  to  the  Gamma  family,  the  appropri¬ 
ate  polynomials  are  those  of  Laguerre  defined  by  [38] 

£,*(*)  =  ^7-  7r(<-''")  (B.2) 

l  !  dx 


The  orthonormality  relation  is 


OO 


/  t-,‘LHx)L$x) 
0 


r(q  +1  +  i)  6 


hence  the  expansion  is 

00 

/(*)=  E  C,  e-x‘L,‘(x) 

i  = 0 

where 


(B.3) 


(B-4) 


Ci  = 


i ! 

T(a  +1  +  1) 


OO 


/  V(*  )  f  (x  )dx : 
o 


(B.5) 


The  first  four  polynomials  are 


LZ{x)  =  l  (B.6) 

L  ®  (x  )  =  1  +  a  —  x  (B.7) 

Ll(x)  =  -i-  [(d  +  1)  (a  +  2)  -2x(a  +  2)  +  a:2]  (B.8) 

LI  (x  )  —  4-  [(a  +1)  (a  +2)  (a  +3)  -  3x  (a  +2)  (a  +3)  +  3j2(a  +3)  -  x3]  (B.9) 

6 

Like  the  Edgeworth  expansion,  it  is  possible  to  eliminate  the  terms  i  =  1,  2  by  a 


transformation.  Defining  x=by,  the  following  expressions  are  obtained  from  (B.5)  and 
(B.6)-(B.9): 


co  = 

C,= 


1 

bT{a  +  1) 
1 

bT{a  +  2) 


[(a  +  1) 


(B.10) 

(B.ll) 


~  67(7+3)  1(a  +  2) (a  +  1} ■ 2(a  +  2)  T  +  v] 


(B- 12) 
(B.13) 


^3  —  £  +4j  [(a+3Xa+2)(a+l)-3(a+3)(a+2)yj  +3(a+3)-^|  -  -—} 

where  jt,-  =  Ex(x' ).  Solving  for  C  x  =  C2  =  0,  we  obtain: 


,  M2~Mi  a  1 

b  =  —tr  =  j 

a  =  M!2/(M2  -  Mi2)  ~  1  =  <* 
Hence,  upon  substituting  back  with  y  —  fix 


(B.14) 

(B.15) 


/  (»)  -  +  T(-q^3)  [(«  H-  2)#.a  -  /?M3]  0V "*  *' a"'1  (/?*  )  + 


(B.16) 


The  leading  term  is  recognized  as  the  Gamma  density  which,  as  a— ►!,  tends  to  the 
desired  exponential  density. 

Integrating  (B.16)  to  get  the  false  alarm  probability  Pr  {x  >  *Mi)  ==  Pr{y  >  ta) 
is  simplified  by  using  the  defining  expression  for  the  Laguerre  polynomials  (B.2).  Some 
tedious  algebra  yields: 

Pfa  =  Pr  {*  >  t  Mi)  =  [1  -  I  (a,  t  a)]  + 

+  Mi,  M3)e_a<  (<** )“  [2 at  (a  +  2)  -  (at  )2  -  (a  +  1)  (a  +  2))  +  •  •  •  (B.17) 

o 


where 


"  rTT!T  (a+1XQ+2) - T 

P  r(a  +  3)  Ml3 


(B.18) 


I  I 

r t  ±\  A  f  e~‘  t  1dt 

/(a'  at)  =  {  T(a)  ' 


(B.19) 


is  the  incomplete  Gamma  function.  It  is  clear  from  (B.17)  that  the  second  term  vanishes 
for  t  =0  as  well  as  for  t  =oo,  thus  fulfilling  some  of  the  properties  of  a  proper  distribu¬ 
tion  (since  the  first  term  is  a  proper  distribution).  However,  it  cannot  be  said  whether  or 
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not  (B.16)  can  assume  negative  values,  a  common  feature  of  all  finite  Gram-Charlier 
expansions.  The  following  term  in  the  expansion  turns  to  be  so  complicated  so  it  will 
not  be  given  here. 

To  get  specific  results  for  the  coherent  envelope  statistics  R  defined  in  the  begin¬ 
ning  of  the  appendix,  we  need  to  express  the  moments  of  R  in  terms  of  those  of  the  i.i.d. 

i  n  n 

samples,  E(R  m  )  =  — —  E  [(£>,-  )2  +  (ESi)2]™-  Using  the  moment  generating  func- 
nm  i  i 

tion,  tedious  but  straightforward  manipulations  yield 

jt*!  =  E{R)  =  -  of  (B.20) 

n 

M2  =  £(i?2)=ifi_[4+  3  (B.21) 

n z  n  a? 

K  =  E(R3)=^[24+^  +  ±(L-V>K)\  ,  L  =-—f  -15  (B.22) 

n  n-n  <7,- 

where  a2  —  E  (zi2),  rrij  —  E{x?)  and  K  is  the  coefficient  of  kurtosis.  When  are 
Gaussian,  both  K  and  L  obviously  vanish.  Substituting  these  into  (B.15)  and  (B.18)  we 
obtain: 

a  =  (l  +  K/2n)~x  (B.23) 

=  nSTaJ  l3“  +  2  -  “2<5  +  f  +  <B  24> 

Hence,  the  limiting  behavior  of  the  approximation  with  only  the  first  four  terms  of 
(B.4)  occurs  when  the  number  of  samples  n  is  much  greater  than  the  kurtosis 
(K /n  — *■  0).  Then  a  — ►  1  and  the  first  term  in  (B.17)  approaches  e-t  as  required  by  the 
central  limit  theorem,  though  (unlike  the  Edgeworth  expansion)  the  first  term  itself  actu¬ 
ally  includes  all  powers  of  K /n  in  a  highly  nonlinear  fashion.  When  a  <  1,  a  com¬ 
parison  with  tables  of  the  incomplete  Gamma  function  shows  that  for  sufficiently  large  t 
(such  that  the  nominal  false  alarm  probability  Pja  =  e~l  is  small),  the  outcome  of  Eq. 
(B.17)  could  be  orders  of  magnitude  higher  than  the  Pja  as  expected.  In  the  second 
term  of  (B.17),  all  the  expressions  tend  to  a  constant  when  cn— +1  except  C 3'  which 
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vanishes  identically  when  a  =  1. 

lim 


n 


For  other  values  of  a,  expansion  in  K  jn  gives 


C7  =—  +o(^-) 

n  12 


(B.25) 


For  the  normal-normal  mixture:  /  (x,-)  =  (1  -  e)  iV(0,  1)  -I-  eN( 0,  C2), 


=  3e(l  -  e) 


C2-  1 


12 


15 

2 


(B.26) 

(B.27) 


1  -  e  +  fC2  I  tC2  »  l  e 

I  =  15  i1  — +— -  1 

(1  -  e  4-  eC2)3  J  iC2»  l  e 

Therefore,  with  small  amounts  of  large  contamination  and  n  not  very  large,  a  can  be 
much  smaller  than  one  and  the  contribution  of  (B.24)  is  also  not  negligible. 


Actual  computation  of  (B.17),  using  equations  (B.20)-(B.24),  (B.26),  (B.27),  is  shown 
in  figures  2. 1-2.5  of  section  2.2  and  compared  to  Monte-Carlo  simulation  results.  For  a 
roughly  in  the  range  [0.5,1],  this  approximation  is  seen  to  be  quite  good.  It  is  still  a  rea¬ 
sonable  prediction  of  the  amount  of  increase  in  the  false  alarm  probability  down  to 


a  =  0.3. 
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APPENDIX  C 

The  Maximin  Test  for  a  Single  Observation 

For  a  single  observation  of  a  r.v.  R  which  comes  from  simple  mixture  hypotheses 
P0  and  Pj  as  in  Eq.  (1.1),  Huber’s  [2]  maximin  robust  test  takes  the  form 


d*\R) 


Hi 

with  probability  c 

Ho 


l(R  ;  L'  ,  L  "  )>  t 
1{R  \  L'  ,  L  "  )  =  t 
l(R  ;  L'  ,  L"  )  <  t 


(C.l) 


where  the  soft  limited  1{R,L'  ,  L"  )  is  given  by  Eqs.  (1.6-1. 8).  The  constants 
0  <  L'  <  L"<o o  are  found  by  solving 

PX{L  >L'  }  +  L  '  P o{L  <  L'  }  =  (1  -  e)-1  (C.2) 

P0{L  <L"  }  +  (l /L"  )P  x{L  >  L"  }  =  (1  -  e)"1  (C.3) 

where  L  (R  )  =  /  ±(R  )/  f  0(R  )  is  the  LR  under  the  nominal  situation,  and  is  assumed  to 
be  continuous..  Application  of  these  equations  to  the  distribution  of  the  coherent 
envelope  of  a  Rayleigh  narrowband  signal  in  narrowband  Gaussian  noise,  Eqs.  (3. 3-3. 4), 
results  in  Eqs.  (3. 6-3.7). 

Since  l(R  ,  L1  ,  L  "  )  is  a  monotone  increasing  function  of  L  (R  ),  which  in  turn  is 
monotonic  in  R  ,  Eq.  (C.l)  can  be  reformulated  as  a  randomized  test  on  R  .  The  thres¬ 
hold  and  the  randomization  constant  are  functions  of  the  least  favorable  p.d.f.’s.  Three 
different  cases  are  possible,  according  to  t  =  L'  ,  L1  <  t  <  L"  and  t  =  L  "  ,  see 

Figure  1.  Define  R '  and  R "  by  L(R'  )  =  L'  ,  L  (R "  )  =  L"  .  Define  also 

00  00 

a(x)  =  J  foir)dr,  Pix)  —  J  f  iir)dr  ■  Then, 

X  X 


Case  A 


t  =  L  '  =>  e  <  (a0  -  a{R  1  ))/(!  -  a{R  '  )) 


(C.4) 


d*(R)  = 


H i  ,  R  >  R1 

Hx  with  probability  c  ,  R  <  R1 


(C.5) 


where  0 <c  <1  is  given,  as  a  function  of  the  desired  a0  and  of  R  '  which  solves  (C.2),  by 


-  108  - 


a0  -  e  -  (1  -  e)ot(R '  ) 

C_  (1  -  0  (1  -a(R')) 

The  detection  probability  under  the  least-favorable  q  {  is 

0{d*,qt)  =  c  +(1  -£)(1  ~c)0{R'  )  =  1  -  (1  ~a0)L 


Case  B 


L  <t  <L"  =>  °°  ~  a(j? \  <  t  <  1 

1  -  a(R  '  )  1  -  a(R  "  ) 


Hl  R  >Rt 


where 


Here, 


d*(R)~  {  h0  ,  R  <Rt 


atQ  —  e  +  (1  —  e)  a(Rt ) 


Cage  C 


L"  =  t  =>  €  >  (a0  -  a{R  "  ))/(!  -  a(R  "  )) 


(C.6) 

(0.7) 

(0.8) 

(C.9) 

(C.10) 

(0-11) 

(0-12) 


where 


{Hi  with  probability  c 

b.  : 


R  >  R" 
R  <  R" 


(C.13) 


Here, 


_ <*o _ 

e  +  (1  -  e)  a(R  "  ) 


(C.14) 


«o(l  -f)P(R")  T  „ 
£+(!-£)  a{R  »  )  a°L 


(C.15) 


Notice  that  in  this  case  where  large  e  necessitates  randomized  maximin  test,  the  result¬ 
ing  maxim  in  lower  bound  on  /?,  Eq.  (C.15)  is  very  small,  as  it  is  proportional  to  the 
desired  a;0. 

The  proof  of  Eqs.  (C.4-C.15)  follows  from  (C.1-C.3).  For  Case  A,  (C.l)  is 
translated  into  (C.5)  as  a  result  of  the  above  mentioned  monotonicity  properties.  To 
satisfy  the  constraint  on  the  probability  of  false  alarm,  we  must  have 
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a0  =  Po{R  >R'}  +  c  Po{R  <  R'  } 

=  Pq{R'  <R  <  R"  }  +Pq{R  >R”  }  +  cPq{R  <  R 1  } 

Utilizing  the  expression  for  the  least  favorable  density  q q,  Eq.  (16), 

<*0  =  (1  -e)P0{R'  <R  <R "  }  +  c(l-e)P0{R  <R>  }  +  ^APx{R  >  R "  } 

The  last  term  is  equal  to  1  -  (1  -  e)P ${R  <  R"  },  by  virtue  of  Eq.  (C.3).  Collect¬ 
ing  terms,  we  obtain 

a0(l  -  «)(1  -  a(R'  ))(c  -  1)  +  1 

from  which  (C.6)  follows.  In  order  to  satisfy  c  >  0,  we  get  the  appropriate  e  range,  Eq. 
(C.4).  In  a  similar  manner,  from  Eq.  (1.2)  for  q  * , 

>R'  }  +  cPi{R  <R'  } 

=  (1  -e)P1{R  >R'  }  +(1  -e)  L'cP0{R  <  R '  } 

and  the  last  term,  by  virtue  of  Eq.  (C.2),  is  equal  to  c  [1  -  (l  -  t)P \{R  >  R  1  }]•  Col¬ 
lecting  terms,  the  first  equality  of  Eq.  (C.7)  is  obtained,  and  the  second  one  is  found  by 
substitution  of  (C.6)  and  using  (C.2)  again. 

Cases  B  and  C  are  proved  in  a  similar  manner. 
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Appendix  D 

Statistics  of  the  SSQME  Test  with  Unmatched  Frequency 


We  first  generalize  the  asymptotic  normality  of  M-estimators  for  a  sequence  of  r.v.’s 
which  are  not  identically  distributed. 

Lemma 

Let  /  Gty  of  Section  4.1  and  assume  also  that  /  is  bounded.  Let  {a:,  }  be  a  sequence  of 

1  Xi  -Ami 

independent  r.v.’s  with  p.d.f.  - /( - )  where  /  is  symmetric  and  absolutely 

<7,  <7,- 

_ _  A  n 

continuous,  and  lim  nA2=a2< oo.  Then  VnAn,  where  An  =arg  {  / (xj  — )=0}  is 


asymptotically  distributed  as  N (m  ,  V)  where 


m  = 


m(‘') 


(D.l) 


and 


— E£.(/2)  A 

v  =  —2 -  ,  E,(l2)  )—  (D.2) 

"i 

n 

Proof  The  proof  follows  Huber’s  [1]  with  some  modifications  to  account  for  the 
unequal  means  and  variances,  hence  it  will  only  be  briefly  sketched.  By  monotonicity  of 

'(0, 

lim  Prob  {An  \fn  <  k  }  =  lim  Prob  {£]/ (z,  -  kn _1//2)  <0}  (D.3) 

n  — »oo  n  — »oo 

Define  yn—l(xi~kn~1^).  Upon  expanding  in  a  Taylor  series  of  powers  of  n-1/2,  and 
invoking  the  assumed  symmetry  properties  of  /  and  /,  some  terms  vanish  and  one 
obtain  es: 

y„.)  =  -*W)  +  0(n_1)  (D.4I 

^iVar(fc.)  =  £,(/2)  +  0(»->)  (D.5) 

There  is  a  slight  complication  since  the  yn<  are  different  for  different  values  of  n  ,  and  the 
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usual  formulations  of  normal  convergence  do  not  apply.  However,  this  case  is  covered  by 
the  extended  Lindeberg  -  Feller  theorem  [41],  which  asserts  that  a  necessary  and 
sufficient  condition  for  convergence  to  a  standard  normal  distribution  of 

r«  O  r * 

sn  =  Yj{y„  -mn  )fcrn  with  an~Yjvni  is  ^at  each  summand  contributes  negligibly  to 

i = 1  '  i=l 

sn  in  the  sense  of  Lindeberg  condition 

vn 

lim  max  — —  =  0  (D.6) 

n  -*  00  1  <  «'  <  r„  o* 

Since  /(■)  is  bounded  and  v„  f^O,  <j2  grows  as  n  and  the  condition  is  satisfied.  Inter¬ 
changing  of  lim  and  Prob  {•}  is  allowed  since  {A„ }  converges.  Subtracting 

n  -*oo 

lim  Y]mn  lan=(m~k)IV  from  both  sides  of  the  argument  of  (D.3)  yields 

n  -*oo  ‘ 

lim  Prob  { An  ^<k)  =  (D.7) 

n  — *00  V 


We  now  apply  the  lemma  to  the  case  of  frequency  deviation  treated  in  section  5.3. 
The  means  of  the  quadrature  noise  components  are  given  by  E{x{)=  A  cos{4>+<t>i)  and 
E{yi  )=  A  sin(0+0,- )  where  — i2nfi/fr ,  and  the  variances  are  identical.  Denote 
the  M-estimators  by  I  and  Q  for  the  x  and  y  channels,  respectively.  They  are  jointly 
Gaussian  with  equal  variance  V  =E (l2)/ E\l')  (since  the  noise  components  have  identi¬ 
cal  variance),  and  their  means  are  E  [I)=\fnA  J]cos( <f>+<t>i )  and  E(Q)= 
\fn  A  5]sin(0+0i )  from  (D.l).  The  argument  of  the  exponent  of  the  joint  Gaussian  den¬ 
sity  is  thus 


I2+  Q2+E\I)+E\Q)-2{E{I)I  +E(Q)Q]  } 


The  middle  pair  does  not  depend  on  (f> 
1 2/ 


E  ^  +  E  (Q)  =[^cos(^+^. )]2  +  [£sin(^+0.)]2  = 


nA 


E* 


j  ) 


smlir  ft/ fT) 
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where  c,-  -^cos(^,- )  and  ^sin(^,- ),  by  straightforward  calculation.  Also, 

IE{I)  +  QE{Q )  =  v^T^co^^tan-H— )) 
vn  A  C 

where  C  +Q  Yjsi  and  5  A  -I^s{  +Q  £)c,-  ,  from  which  C2+S2— 

(I2jrQ2)G  (/ d)  follows  as  can  be  verified  by  substituting  for  the  definitions.  We  finally 
have 


f  {I, Q  |  A  »<A)=2^ex p 


(D.8) 

I2+Q2+nA2G(f  d)-2\/n  A  y/{ I2+Q2)G(fd  Jcos^-tan'^  ~) 

_  c 

A*? 


Averaging  this  with  respect  to  <f>  on  [0,2tt]  gives  the  same  expression  as  in  section  4.2, 
since  5  and  C  do  not  depend  on  <j>,  and  where  the  effective  SNR  is  replaced  by 
nA2G  (f  d)j  2V.  Hence,  the  detection  probabilities  for  the  various  SSQME  tests  of  Sec¬ 
tion  5.2  are  identical  with  the  case  / d=0  if  the  effective  SNR  is  attenuated  by  G(f  d). 

In  the  FFT  processor,  the  sidelobes  can  be  reduced,  and  the  response  can  be 
flattened  over  the  mainlobes  of  the  individual  filters  (  |  nf  d  /  f  r  |  <0.5),  at  a  price  of 
reduced  SNR  at  f  d  =  0,  by  appropriate  weighting  (e.g.,  Hamming’s): 

n 

{  x,y}— ►{wrx,wTy},  which  changes  G(fd)— *■  |  wk  exp(;  2nkf  d  //r )  |  2  The  same 

A=i 

can  be  done  with  the  SSQME  test.  The  derivation  of  the  test  statistic  is  similar  to  that 

above,  but  now  we  have  to  account  for  the  unequal  variances  as  a  consequence  of  the 

weighting.  The  frequency  response  will  be  given  by  G(fd)= 

I  Y!iwk'exp{j27rkfd/fr)  I  2>  where  wk' A  Ek  (/')/£■£*  (*')■  The  design  of  the  weighting 
k  k 

coefficients  is  more  complicated  than  in  the  linear  FFT  case;  first  {wk'  }  are  chosen  for 
the  desired  response,  and  then  J l'{wk  x  )/  ( x  )dx  =  wk'  has  to  be  solved  for  {wk  }. 
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ROBUST  DETECTION  OF  FADING  NARROW  -  BAND  SIGNALS 
IN  NON  -  GAUSSIAN  NOISE 

ERRATA 

(1)  Page  4,  the  end  of  the  third  paragraph  should  end  with:  VHSIC  technology  [16]. 

(2)  Page  19,  in  the  table  heading;a  diagonal  separator  is  missing.  The  first  entry  should 
read: 

(3)  Page  25,  second  line  before  Eq.(3.10),  (C.7)  should  be  changed  to  (C.ll). 

(4)  Page  32,  Eq.(4.2)  should  be  corrected  to 

^x(0=j4r?{E's.  /(a:t  --st  ^i)=0}  ,  ^  y(/  )=Arg  {  E  si 1  [Vi  ~si  )=0}  (4.2) 

1=1  i=  1 

1  n 

(5)  Page  35,  in  Eq.(4. 11),  the  definition  of  C  should  read:  C  A  lim  —  E  si2 

n  —  oo  Tl  j  _  j 

(6)  Page  44,  line  9,  *qalmost”  should  read:  "almost”. 

(7)  Page  61,  line  5  from  the  end,  should  finish  with:  technology  [16]. 


